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Editorial 


Eureka 62 


hen the Archimedeans asked me to edit Eureka for the third 
time, I was a bit sceptical. Issue 60 was the first to get a pa- 
perback binding and issue 61 was the first to be published in 


full colour and with a new design. How could we make this issue special 
— not just a repeat of the previous one? 


Eureka has always been a magazine for students, not a research journal. 
Articles should be interesting and entertaining to read, and often they 
are a stepping stone into particular problems or areas of mathematics 
which the reader would not usually have encountered. 


Every year we receive many great articles by students and mathemati- 
cians. Our task as editors is often to make them more visually appealing 
- and we can do so using images, diagrams, fonts or colours. 


What we wanted to add in this issue was interactivity, such as videos, 
slideshows, animations or games. Unfortunately this still is quite dif- 
ficult on paper, so we decided to publish a second version of Eureka as 
interactive eBook for mobile devices like iPad. And we hope that this 
will make reading mathematics even more engaging and fun. 


The digital version will, for the first time, make Eureka available to a 
large number of students outside Cambridge. And therefore we have 
reprinted some of the best articles from previous issues. We spent many 
hours in the library archives, reading old copies of Eureka, though of 
course there are many more great articles we could have included. 


The articles in this issue are on a wide range of topics - from num- 
ber theory to cosmology, from statistics to geometry. Some are very 
technical while others are more recreational, but we hope that there is 
something interesting for everyone. 


I want to thank the editorial team for all their work, and the authors for 
their excellent articles. We hope you will enjoy reading Eureka 62! 
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The Archimedeans 


Yuhan Gao, President 2012 — 2013 


/ | his year was yet another highly successful 
one for The Archimedeans. The society 
welcomed over 150 new members, courtesy 

of a very popular Freshers’ Squash. We hosted a 

number of talks given by speakers from the uni- 

versity over the course of Michaelmas and Lent. 

These covered a number of different topics, cater- 

ing for those with interests in pure, applied and 

applicable mathematics. Highlights included 
talks by Prof. Grae Worster on Ice, and Prof. Imre 

Leader on Games of Pursuit and Evasion. 


The society expanded the range of events which 
we offered to our members this year. We held a 
board games evening, which proved to be a thor- 
oughly enjoyable night for all those who attended. 
One of our most anticipated events was the black- 
tie Annual Dinner in the delightful surroundings 
of the Crowne Plaza Hotel. 


A tradition of the Archimedeans is to hold an an- 
nual Problems Drive. This time around, teams 
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Smallest composite number. Number of 
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from as far afield as Oxford came to take part in an 
engaging and entertaining mathematics competi- 
tion. Prizes were awarded not only for the teams 
with the highest scores, but also for particularly 
creative team names. The questions given can be 
found in this journal, and we welcome you to try 
them yourself. 


The year finished on a high in May Week, courtesy 
of the Science and Engineering Garden Party. Six 
societies from the university joined together to 
host a brilliant afternoon of fun, aided by a jazz 
band. Finger food and Pimm’s was served, and 
there was even a cheese bar on offer. 


We would like to thank our members for contrib- 
uting to an excellent year for the society. I would 
also like to thank the committee for all of their 
hard work, and Philipp Kleppmann, last years’ 
President, along with the previous committee, for 
everything which they have done for the society. 
We look forward to another exciting year ahead. 
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Talking to C 
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roughly half my life nurturing one particular 


I love computer languages. In fact, I’ve spent 
very rich computer language: Mathematica. 


But do we really need computer languages to tell 
our computers what to do? Why can’t we just use 
natural human languages, like English, instead? 


If you had asked me a few years ago, I would 
have said it was hopeless. That perhaps one could 
make toy examples, but that ultimately natural 
language just wouldn't be up to the task of creat- 
ing useful programs. 


But then along came Wolfram|Alpha in which 
we've been able to make free-form linguistics 
work vastly better than I ever thought possible. 


But still, in Wolfram|Alpha the input is essen- 
tially just set up to request knowledge - and 


Smallest perfect number. Hexagonal tilings give 
the densest ‘sphere’ packing in two dimensions. 


ale). 


Wolfram|Alpha responds by computing and 
presenting whatever knowledge is requested. But 
programming is different. It is not about gen- 
erating static knowledge, but about generating 
programs that can take a range of inputs, and 
dynamically perform operations. 


The first question is: how might we represent 
these programs? In principle we could use pretty 
much any programming language. But to make 
things practical, particularly at the beginning, 
we need a programming language with a couple 
of key characteristics. 


The most important is that programs a user might 
specify with short pieces of natural language 
must typically be short - and readable - in the 
computer language. Because otherwise the user 
won't be able to tell - at least not easily - whether 


the program that’s been produced actually does 
what they want. 


A second, somewhat related criterion is that it 
must be possible for arbitrary program frag- 
ments to stand alone - so that large programs 
can realistically be built up incrementally, much 
like a description in natural language is built up 
incrementally with sentences and the like. 


To get the first of these characteristics requires 
a very high-level language, in which there are 
already many constructs already built in to the 
language - and well enough designed that they 
all fit together without messy “glue” code. 


And to get the second characteristic essentially 
requires a symbolic language, in which any piece 
of any program is always a meaningful symbolic 
expression. 


Conveniently enough, there is one language that 
satisfies rather well both these requirements: 
Mathematica! 


The linguistic capabilities of Wolfram|Alpha give 
one the idea that one might be able to under- 
stand free-form natural language specifications 
of programs. Mathematica is what gives one the 
idea that there might be a reasonable target for 
programs generated automatically from natural 
language. 


For me, there was also a third motivating idea - 
that came from my work on A New Kind of Sci- 
ence. One might have thought that to perform 
any kind of complex task would always require 
a complex program. But what I learned in A New 
Kind of Science is that simple programs can often 
do highly complex things. 


And the result of this is that it’s often possible to 
find useful programs just by searching for them 
in the computational universe of possible pro- 
grams - a technique that we use with increas- 
ing frequency in the actual development of both 
Wolfram|Alpha and Mathematica. 
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And it was this that made me think that - even if 
all else failed - one might be able to “synthesize” 
programs from natural language just by search- 
ing for them. 


OK - so there are reasons to hope that it might be 
possible to use natural language input to do pro- 
gramming. But can one actually make it work? 


Even when Wolfram|Alpha was launched, I 
still wasn’t sure. But as we worked on bringing 
Wolfram|Alpha together with Mathematica, I got 
more and more optimistic. 


And with Mathematica 8 we have launched the 
first production example. It is certainly not the 
end of the story, but I think it’s a really good 
beginning. And I know that even as an expert 
Mathematica programmer, I’ve started routinely 
using natural language input for certain steps in 
writing programs. 


One can also specify programs in natural lan- 
guage to apply to things one has constructed in 
Mathematica. Andina Mathematica session, one 
can discard the natural language and just use the 
generated code by clicking that code. Some inter- 
esting examples are shown above. 


Now, of course, there are many issues - for exam- 
ple about disambiguation. But the good news is 
that we’ve got schemes for addressing these that 
we've been able to test out well in Wolfram|Alpha. 


Number of bits in a byte. Side length of a 
chess board. Atomic number of oxygen. 
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I have to say that something I thought would be a 
big issue is the vagueness of natural language. That 
one particular natural language input might equal- 
ly well refer to many different precise programs. 


And I had imagined it would be a routine thing 
to have to generate test examples for the user in 
order to be able to choose between different pos- 
sible programs. 


But in reality this seems to be quite rare: there is 
usually an “obvious” interpretation, that in typi- 
cal Wolfram|Alpha style, one can put first, with 
the less obvious interpretations a click away. 


So how well does this all work? We have built out 
some particular areas of program functionality, 
and we will progressively be building out many 
more as time goes on. 


They are primarily set up to work in Mathemat- 
ica. But actually you can see most of them in 
some form just on the Wolfram|Alpha website 
- though obviously no references to variables or 
other parts of a Mathematica session can be used. 


How robust is it all? It’s definitely usable, but I 
would certainly like it to be more robust - and we 
will be working hard in that direction. 


One issue that we have faced is a lack of linguistic 
corpora in the area. We’ve scoured a couple of 
decades of our own tech support logs, as well as 


many programming forums, to try to find natu- 
ral language descriptions matched with precise 
programs. But we haven't be able to apply any- 
thing like the same level of automatic filtering 
to this process as we've been able to apply in 
many other areas of “linguistic discovery” for 
Wolfram|Alpha. 


There are zillions of fascinating research projects 
to do in figuring out generalized grammars for 
specifying different kinds of programming con- 
structs in natural language - and I will look for- 
ward to seeing this field of inquiry develop. 


We now have another important source of data: 
actual examples of natural language program- 
ming being done in Mathematica. And look- 
ing at our real-time monitoring system for the 
Wolfram|Alpha server infrastructure, I can see 
that very soon we are going to have a lot of data 
to study. 


How far will it be possible to get with natural 
language programming? Even six months ago 
I thought it was only going to be possible to do 
fairly simple examples. But seeing what we have 
actually been able to build, I am extremely opti- 
mistic about what will be possible. 


The hope would be that in the end one will just 
have to describe in natural language the goal for 
one’s program - and then an actual program that 
achieves that goal will be synthesized. Some- 
times this will directly be possible from under- 
standing the specification of the goal. Sometimes 
to create the necessary program will require a 
whole program-creation process — probably of- 
ten involving searching for an appropriate pro- 
gram ina space of possible programs, in the style 
of A New Kind of Science. 


It will be important to do program simplification 
- again often achieved by program search - in or- 
der to be able to get the simplest and most read- 
able (and perhaps the most efficient) program 
that meets the requirements that have been given. 


At this point, I am still concerned about how 
much of this will be possible in “interactive times” 
of a few seconds. But if history is a guide, with 
good algorithms and heuristics, and a healthy 
dose of large-scale parallelism, it'll gradually be 
possible to get the times down. 


So what will be the result? I expect natural lan- 
guage programming will eventually become 


This article is reprinted from his blog at 


blog.stephenwolfram.com with kind 
permission of Stephen Wolfram. 


ubiquitous as a way of telling computers what to 
do. People will be able to get started in doing pro- 
gramming-like tasks without learning anything 
about official “programming” and programming 
languages: they'll just converse with their com- 
puters as they might converse with another per- 
son. 


What will happen to programming languages? 
Actually, I think they’ll become much more vis- 
ible and widely known than ever before. Because 
in natural language programming interfaces one 
will probably be shown the programming lan- 
guage code that’s being synthesized. 


People will see that, and gradually learn cases 
where it’s much faster and more precise just 
to enter code like that directly, without going 
through natural language. 


By the way: in Mathematica we are beginning to 
have code generation capabilities for low-level 
languages like C. So it’s going to be technically 
possible to go all the way from natural language 
input down to something like C. And for some 
practical purposes - especially with embedded 
systems - that will no doubt be quite useful. 


But when it comes to doing traditional program- 
ming alongside natural language programming, 
there’s going to be a great premium on having a 
succinct readable programming language - like 
Mathematica. 


With the free-form linguistics of Mathematica 
we are at the first step in a long journey. But it is a 
journey I’m now confident we can take. After so 
many years, the science-fiction concept of being 
able to tell a computer what to do by using plain 
human language is gradually going to become 
reality - in a way that fascinatingly coexists with 
what’s been achieved in high-level computer lan- 
guages. 


Number of muses in Greek mythology. 
Exponential factorial, since 9 = a2 
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Pentaplexity 


Sir Roger Penrose 


First published in issue 39, 1978 


ertain shapes, when matched correctly, 
( can form a tiling of the entire plane but in 

a non-periodic way. These tilings have a 
number of remarkable properties, and I shall give 
here a brief account explaining how these tiles 
came about and indicating some of their proper- 
ties. 


‘The starting point was the observation that a regu- 
lar pentagon can be subdivided into six smaller 
ones, leaving only five slim triangular gaps. This is 
familiar as part of the usual “net” which folds into 
a regular dodecahedron, as shown in Figure 1. 
Imagine now, that this process is repeated a large 
number of times, where at each stage the penta- 
gons of the figure are subdivided according to the 
scheme of Figure 1. There will be gaps appearing 


A Figure 1 A Figure 2 


of varying shapes and we wish to see how best to 
fill these. At the second stage of subdivision, dia- 
mond-shaped gaps appear between the pentagons 
(Figure 2). At the third, these diamonds grow 
“spikes’, but it is possible to find room, within 
each such “spiky diamond’, for another pentagon, 
so that the gap separates into a star (pentagram) 
and a “paper boat” (or Jester’s cap?) as shown in 
Figure 3. At the next stage, the star and the boat 
also grow spikes, and, likewise, we can find room 
for new pentagons within them, the remaining 
gaps being new stars and boats (as before). These 
subdivisions are shown in Figure 4. 


Since no new shapes are now introduced at sub- 
sequent stages, we can envisage this subdivision 
process proceeding indefinitely. At each stage, the 
scale of the shapes can be expanded outwards 
so that the new pentagons that arise become the 


*2 ¢ 


A Figure 3 


First Apollo mission to land of the moon. 
Number of space-time dimensions in M-theory. 
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A Figure 4 


A Figure 5 


A Figure 6 


Number of signs in the Zodiac. Number of days of 
Christmas. Number of apostles. Sublime number. 


same size as those at the previous stage. As things 
stand, however, this procedure allows ambiguity 
that we would like to remove. The subdivisions of 
a “spiky diamond” can be achieved in two ways, 
since there are two alternate positions for the pen- 
tagon. Let us insist on just one of these, the rule 
being that given in Figure 5. (When we examine 
the pattern of surrounding pentagons we neces- 
sarily find that they are arranged in the type of 
configuration shown in Figure 5.) It may be men- 
tioned that had the opposite rule been adapted for 
subdividing a “spiky diamond’, then a contradic- 
tion would appear at the next stage of subdivision, 
but this never happens with the rule of Figure 5. 


This procedure, when continued to the limit, leads 
to a tiling of entire plane with pentagons, dia- 
monds, boats and stars. But there are many “in- 
correct” tilings with the same shapes, being not 
constructed according to the above prescription. 
In fact, “correctness” can be forced by adopting 
suitable matching rules. The clearest way to de- 
pict these rules is to modify the shapes to make a 
kind of infinite jigsaw puzzle, where a suggested 
such modification is given in Figure 6. It is not 
hard to show that any tiling with these six shapes 
is forced to have a hierarchical structure of the 
type just described. 


Properties of these Tilings 


Furthermore, the forced hierarchical nature of 
this pattern has a number of very remarkable 
properties. In the first place, it is necessarily non- 
periodic (i.e. without any period parallelogram). 
More about this later. Secondly, though the com- 
pleted pattern is not uniquely determined - for 
there are 2*° different arrangements - these dif- 
ferent arrangements are, in a certain “finite” sense, 
all indistinguishable from one another! Thus, no 
matter how large a finite portion is selected in one 
such pattern, this finite portion will appear some- 
where in every other completed pattern (infinitely 
many times, in fact). Thirdly, there are many un- 
expected and aesthetically pleasing features that 
these patterns exhibit (see Figure 7). For exam- 
ple, there are many regular decagons appearing, 
which tend to overlap in places. Each decagon is 
surrounded by a ring of twelve pentagons, and 
there are larger rings of various kinds also. Note 
that every straight line segment of the pattern ex- 
tends outwards to infinity, to contain an infinite 
number of line segments of the figure. The hier- 


A Figure 7 


archical arrangement of Figure 7 is brought out 
in Figure 8. 


After I had found this set of six tiles that forces 
non-periodicity, it was pointed out to me (by Si- 
mon Kochen) that Raphael Robinson had, a num- 
ber of years earlier, also found a (quite different) 
set of six tiles that forces non-periodicity. But it 
occurred to me that with my tiles one can do bet- 
ter. If, for example, the third “pentagon” shape is 
eliminated by being joined at two places to the 
“diamond” and at one place to the bottom of the 
“boat’, then a set of five tiles is obtained that forces 
non-periodicity. It was not hard to reduce this 
number still further to four. And then, with a lit- 
tle slicing and rejoining, to two! 


The two tiles so obtained are called “kites” and 
“darts”, names suggested by John Conway. The pre- 
cise shapes are illustrated in Figure 9. The match- 
ing rules are also shown, where vertices of the 
same colour must be placed against one another. 
There are many alternative ways to colour these 
tiles to force the correct arrangement. One way 
brings out the relation to the pentagon-diamond- 
boat-star tilings shown in Figure 10. A patch of 
assembled tiles (partly coloured in this way) is 
shown in Figure 11. The hierarchical nature of the 
kite-dart tilings can be seen directly, and is illus- 


A Figure 8 


trated in Figure 12. Take any such tiling and bisect 
each dart symmetrically with a straight line seg- 
ment. The resulting half-darts and kites can then 
be collected together to make darts and kites on 
a slightly larger scale: two half-darts and one kite 
make a large dart; two half-darts and two kites 
make a large kite. It is not hard to convince one- 
self that every correctly matched kite-dart tiling 
is assembled in this way. This “inflation” property 
also serves to prove non-periodicity. For suppose 
there were a period parallelogram. The corre- 
sponding inflated kites and darts would also have 
to have the same period parallelogram. Repeat 
the inflation process many times, until the size 
of the resulting inflated kites and darts is greater 
than that of the supposed period parallelogram. 
This gives a contradiction. 


The contradiction with periodicity shows up in 
another striking way. Consider a very large area 
containing d darts and k kites, which is obtained 
referring to the inflation process a large number 
of times. The larger the area, the closer the ratio 
x = k/d of kites to darts will be to satisfying the 
recurrence relation x = (1 + 2x)/(1 + x), since, 
on inflation, a dart and two kites make a larger 
kite, while a dart and a kite make larger dart. This 
gives, in the limit of an infinitely large pattern, 


A torus can be sliced into 13 pieces with just 
three plane cuts. Number of Archimedean solids. 
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A Figure 9 
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A Figure 10 


x= , (1 + V5) = g, the golden ratio! Thus we 
get an irrational relative density of kites to darts 
- which is impossible for a periodic tiling. (This 
is the numerical density. The kite has ¢ times the 
area of the dart, so the total area covered by kites 
is g? (= 1+) times that covered by darts.) 


Jigsaws and beyond 


There is another pair of quadrilaterals which, 
with suitable matching rules, tiles the plane only 
non-periodically: a pair of rhombuses as shown 
in Figure 13. A suitable shading is suggested in 
Figure 14, where similarly shaded edges are to be 
matched against each other. In Figure 15, the hier- 
archical relation to the kites and darts is illustrat- 
ed. The rhombuses appear mid-way between one 
kite-dart level and the next inflated kite-dart level. 


Many different jigsaw puzzle versions of the 
kite-dart pair or the rhombus pair can evidently 
be given. One suggestion for modified kites and 
darts, in the shape of two birds, is illustrated in 
Figure 16. 


Other modifications are also possible, such as 
alternative matching rules, suggested by Robert 
Ammann (see Figure 17) which force half the tiles 
to be inverted. 


Open meandric number. These are the number of 
meanders in non-self-intersecting oriented curves. 


A Figure 11 


Many intriguing features of these tilings have 
not been mentioned here, such as the pentago- 
nally-symmetric rings that the stripes of Figure 
14 produce, Conway’s classification of “holes” in 
kite-dart patterns (ie. regions surrounded by 
“legal” tilings but which cannot themselves be 
legally filled), Ammann’s three-dimensional ver- 
sion of the rhombuses (four solids that appar- 
ently fill space only non-periodically), Ammann’s 
and Conway’s analysis of “empires” (the infinite 
system of partly disconnected tiles whose posi- 
tions are forced by a given set of tiles). It is not 
known whether there is a single shape that can tile 
the Euclidean plane non-periodically. For the hy- 
perbolic (Lobachevski) plane a single shape can 
be provided which, in a certain sense, tiles only 
non-periodically (see Figure 18) - but in another 
sense a periodicity (in one direction only) can oc- 
cur. (This remark is partly based on suggestions of 
John Moussouris.) 


References 
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Squared Squares 
Philip Kleppmann, Corpus Christi 


smaller squares, all of which have different 

sidelengths. Such rectangles are called squared 
rectangles. Of course, rectangles like this one can be 
constructed by trial and error if you have enough time 
or a computer. The task becomes harder if you try to 
produce a squared square. The challenge of finding 
one arose in the early twentieth century from a prob- 
lem in a mathematical puzzle book called The Can- 
terbury Puzzles [5]. It wasn’t even clear that a squared 
square existed, until R. Sprague found one in 1939 [3], 
more than 30 years later. 


F igure 1 shows a rectangle that is dissected into 


In the 1930s, the four Cambridge undergraduates Ro- 
land Brooks, Cedric Smith, Arthur Stone, and William 
Tutte came across this problem and devised some very 
clever methods of producing squared rectangles and 
squares using the theory of electrical networks, some 
of which I will present here. The present-day logo of 
the Trinity Mathematical Society is a squared square, 
in recognition of the four Trinity students. 


The low-tech method 


Draw a rectangle cut up into smaller rectangles, as in 
Figure 2. Squint at it and imagine that it is just a bad 
drawing of a squared rectangle. Assign values x and y 
to the sidelengths of two of the ‘squares’ as shown in 
the figure. From these it is easy to determine all other 
sidelengths: First x + y above the two starting squares, 
then 2x + y to the left, and so on. We have to make sure 
that the two vertical sides of the big rectangle have the 
same length. For this we need (5x + 3y) + (8x + 4y) = 
(4x + 4y) + (4x + Sy), Le. 5x = 2y. So, taking x = 2 and 
y = 5, we get the squared rectangle in Figure 1. 


Number of pawns ina chess set, and each player 
starts with 16 pieces. Length of bank card numbers. 


A Figure 2 A badly drawn squared rectangle 


This method was used by Arthur Stone to con- 
struct his first squared rectangle [4]. While it is 
easy to apply, it is also luck-dependent. You can't 
count on finding a dissection that can become a 
squaring, and sometimes the equations give nega- 
tive values for some sidelengths. This makes a sys- 
tematic analysis very difficult. For example, if one 
is interested in the smallest number of squares 
that a rectangle can be cut up into, it is not at all 
clear how to show that there are no smaller ones. 
In fact, the smallest number is 9. This was proved 
in [1] using the following more refined method. 


The high-tech method 


Suppose we have a squared rectangle, such as the 
one in Figure 1. We construct a directed graph 
with a vertex for each horizontal line segment and 
an edge for each square. There is an arrow from 
a vertex v to a vertex w if and only if the corre- 
sponding horizontal line segments V and W in 
the rectangle are connected by a square and V is 
above W. We label the arrow with the sidelength 
of this square. Figure 3 illustrates the procedure. 


The graph in Figure 4 is constructed in this way 
from the rectangle in Figure 1. It is called the 
Smith diagram of the rectangle. 


Now call P and Q the poles of the network, and 
interpret the labels of the edges as currents. There 
are a couple of things in the graph that you may 
notice: 


1. For any vertex that isn’t a pole, the sum of 
currents entering it is equal to the sum of 
currents flowing out of it: The sum of the 
sidelengths of squares lying directly above 
one horizontal line segment in the squared 
rectangle is the same as the sum of the 
sidelengths of squares lying directly below it. 


2. The sum of currents around any circuit 
is zero (counting currents in the ‘wrong’ 
direction as negative currents). This is 
because the length of any straight vertical 
path from one horizontal line segment to 
another one is the same, no matter which 
squares it passes through. 


3. The sum of the currents leaving P is equal 
to the sum of the currents entering Q, since 
the lengths of the two horizontal sides of 
the rectangle are equal. 


A Figure 3 How to make a Smith diagram 
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A Figure 4 The full Smith diagram 


And - hey presto! - we've built an electric network 
in which the given currents are valid as long as we 
assume that each wire has unit resistance. (1) and 
(2) are called Kirchhoff’s laws. 


In fact, this construction works in the other direc- 
tion as well: If we construct an electric network 
satisfying the three conditions above and which 
has different currents along all of its wires, then 
it is a blueprint for a squared rectangle! Of course, 
the network encapsulates the same information as 
the squared rectangle, but it has many advantages 
over the first method. Graphs are well-established 
mathematical objects, so we can fall back on a 
large body of theory. In particular, the theory of 
electric networks can be used for further investi- 
gations. See [1] for more. 


Graphs can be searched systematically. For exam- 
ple, to show that there are no squared rectangles 
consisting of fewer than 9 squares we can search 
all directed graphs with at most 8 (and at least 2) 
edges and try to assign distinct values to the edges 
in such a way that the three conditions are sat- 
isfied. But there aren't any [1]! The rectangle in 
Figure 1 consists of 9 squares, so it is proved that 
the smallest number of squares in a squared rec- 
tangle is 9. 


The “least random number”. 
Number of wallpaper groups. 
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A squared square 


One way of finding a squared square is to exhaus- 
tively search through all squared rectangles, until 
you spot one. The smallest has 21 pieces [3], so 
this might take a while. It turns out that looking 
for a squared rectangle that can be cut up into 
squares in two completely different ways (mean- 
ing that none of the squares of one dissection ap- 
pear in the other dissection) will get you there a 
lot faster. 


The smallest such rectangle is 422 x 593 and can 
be cut up into 13 pieces in two different ways [4]. 
The sidelengths of the component squares are 18, 
38, 49, 67, 72, 85, 103, 116, 154, 175, 192, 230, 247 
and 2, 22, 37, 39, 41, 43, 80, 164, 178, 200, 207, 
215, 222, respectively. The two rectangles are 
combined with two squares to form one large 
squared square, as shown in below. You might 
like to assign the sidelengths to the component 
squares yourself! 


Sum of the first three pentagonal numbers, 
and thus a “pentagonal pyramidal number”. 
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Tricky Teacups 


Vito Videtta, Trinity Hall 


ere is a puzzle I found when volunteering 
H at maths outreach events in Cambridge: 
it is called Aunty’s Teacups. We are given 
16 teacups, four each in each of four colours, and, 


similarly, 16 saucers. Arrange the cups on top of 
the saucers in a 4 by 4 square grid so that: 


1. in each row and column, there is one cup of 
each colour; 


2. in each row and column, there is one saucer 
of each colour; 


3. (Orthogonality Condition) no cup-saucer 
colour combination is repeated. (To be clear, 
this means that, for example, red-on-green 
and green-on-red are both allowed.) 


I was rather intrigued when I first saw this puzzle; 
it appeared to be so simple, yet everybody who 
tried it quickly found out it was a Pandora’s Box. 
It was apparent that I was hooked as soon as I got 
home; I immediately started working on a solu- 
tion. Before we get to that though (and to give you 
a chance to try it for yourself), let’s go through 
some of the history of this problem. 


Square arrangements of the above type were first 
studied by LEONHARD EUuLER. In a seminal paper 
published in 1782, he poses the following prob- 
lem: ‘Given a group of 36 officers of six different 
ranks, one each from six different regiments, is it 
possible to arrange the officers in a square, in such 
a manner that in each line, vertical or horizontal, 
there is one officer of each rank and one from each 
regiment?’ This problem came to be known as “Le 
Probleme des 36 Officiers”; despite its apparently 
simple formulation, it turned out to be one of the 


The number of quarter or half turns required to 
optimally solve a Rubik's cube in the worst case. 


hardest mathematical problems ever posed. Con- 
sequently, this led to a barrage of new mathemat- 
ics being created as more and more mathemati- 
cians tried to rise up to Euler’s Challenge. 


As with much great mathematics, this particular 
problem managed to find its way into popular cul- 
ture in the form of many puzzles. One version that 
is particularly simple to set up requires a pack of 
cards. Take the Ace, King, Queen and Jack of each 
suit. Arrange the cards in a 4 by 4 square grid so 
that in each row and column, there is one card 
of each rank and one of each suit. You can easily 
see that this puzzle is equivalent to the Teacups 
problem (in fact, it is slightly easier, since we don't 
have to worry about the orthogonality condition). 


Journey towards a solution 


A Latin square of order n is an nXn matrix con- 
taining n copies of the numbers 1 to n arranged 
so that in each row and column, each number 
appears once and only once. Latin squares arise 
naturally as the multiplication tables of finite 
groups (of course, the symbols we use to label the 
square are unimportant). Suppose A and B are 
Latin squares of order n; we say that A and B are 
orthogonal if (A;, B;) ranges through all possible 
ordered pairs as i, j range through all legal indices. 
Moreover, we call B an orthogonal mate to A. Ina sense, 
orthogonal Latin squares (OLSs) are Latin squares 
that are as different as possible from each other. 


We now see how to translate the Aunty’s Teacups 
problem into mathematics; we are merely search- 
ing for a pair of OLSs of order 4. 


I began my investigation by, rather shamefully, 
writing a computer program to find all possible 
solutions. I started by labelling the four colours 
using the numbers 1 to 4 and using the notation 
(c, s) to describe the entries in the 4-by-4 matrix, 
where c is the cup colour and s is the saucer colour. 
After five hours of coding and a split second of 
computing, it spat out a completely unintelligible 
sequence of the numbers 1 to 4 and, somewhat 
more importantly, the number of solutions it had 
found: 6912. That was nice, but uninformative. 


And so I left the problem there for a while, in 
which time I finished my first two years at Uni- 
versity. But when I showed the puzzle to a young 
girl and her father at another outreach event, an 
obvious fact hit me with more force than a meteor 
strike: given any solution, I could cyclically per- 
mute the rows and columns to generate new solu- 
tions. Topologically then, the grid can be wrapped 
around into a tube and the ends connected to 
make a doughnut; a sort of “Teacup Torus”. 


In fact, I could act on the solution space X by the 
group S,xS, via row and columns swaps on any 
solution. I called this group the Automorphism 
group of X. (This terminology is non-standard; 
I chose it purely by analogy with Galois theory.) 
This allowed me to define an equivalence relation 
~ on the space of solutions with the equivalence 
classes precisely the orbits of the action: we say x ~ 
y iff there is 0 € S,xS, such that x = o(y) where o = 
(0,, 02) acts on y by permuting the rows via o, and 
the columns via 0. Having defined this equiva- 
lence relation, it then became very natural to ask 
if there was a convenient choice of representa- 


for each equivalence class. The answer here 
is ‘yes’; by permuting rows and columns, I could 
always transform any given solution to the form 


(ERS?) | (352). (488) 
(ley 2 2 
Geel 2 2 2 
(4,2) |? 2 2 


Moreover, the action has the following highly de- 
sirable property: if x, y ¢ X are distinct solutions 
in the same orbit, then there is a unique o € S,x S, 
such that x = o(y). Hence, each equivalence class 
has size (4!)? = 576 and so this gives 6912/576 = 
12 different equivalence classes. This was a ma- 
jor step towards the solution, but I was still not 
satisfied; 12 was “too big” and I felt that I could 
quotient out more group actions from the solu- 
tion space. 


Again, progress fell silent as I was revising for my 
final exams. The final piece of the puzzle came 
to me as I walked back to college along Burrell’s 
Walk one evening early in May. I had been revis- 
ing Galois theory, which got me thinking about 
groups and (inevitably) a certain puzzle involving 
pieces of fine china. I was thinking about how I 
could incorporate permutations of colour into my 
solution; clearly, I had completely exhausted all 
possible permutations of rows and columns and 
so colour permutations stood as the final frontier. 
The problem I faced was that in the top-left hand 
corner of the above scheme, colour 1 had already 
been fixed in the second coordinate. It occurred 
to me while I was walking that, actually, this did 
not present a problem: just permute the remaining 


Smallest Fibonacci number whose digits and digit 
sum are also Fibonacci. Magic number in Blackjack. 
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colours in the second coordinate among themselves. 
By doing so, I could force any solution to assume 
the form 


(1,1) | (2,2) | (2) | (42) 
(29) | 2 2 2 
(3,3) |? 2 2 
(4,4) | 2 2 2 


Hence, I had the final automorphism group: 
Aut(X) = S,xS,xS3, a group of size 3456. It re- 
mained to find representatives for the equivalence 
classes. After a bit of searching, they presented 
themselves: 


(41) | (23) | 34) | (42) 
(2,2) | (44) | (443) 13D 
“a \ (3,3) | (41) | (2) | (2,4) 
(4,4) | (3,2) | (2,1) | (1,3) 
(1) | (2,4) | 3,2) | (43) 
(2,2) | (4,3) | (41) | (3,4) 
me (3,3) | (4.3) | 4) | (0) 
(44 | a) (3) 42) 


This was exactly what I was expecting: there are 
two equivalence classes resulting from this action. 
Also, the above solutions lie in different orbits, 
which preserves the uniqueness of action. Hence, 
I had 2x 3456 = 6912 solutions altogether, which 
agreed with my computer search. 


Designing Experiments 


All of the above discussion has a distinctly pure 
flavour and may have made some of my applied 
readers slightly nauseous. Fear not my friends, we 
now present an application of the above theory. 


Suppose that the brilliant genius Prof. Tarquin 
Walter Kornman is organising examinations for 
his students at Camford University. He is cur- 
rently arranging an exam timetable for his four 
students, Alice, Daniel, Grace and Timothy, who 
must each take four papers. Tarquin aims to de- 
sign a timetable that is as efficient as possible, but 
considers it a form of cruelty to force a student to 
sit more than one paper per day. However, in spite 
of his compassion, his genius lends him certain 
eccentricities, chief among which is an extreme 
desire to eliminate bias as far as possible. He does 
so by choosing, for each day, four different exami- 
nation start times. Given this information, how 
should Tarquin finish the exam timetable? 


Number of partitions of 8. Pentagonal and a centred 
heptagonal number. 22/7 is an approximation for 7. 


Here, Tarquin has four collections of symbols, or 
treatments, to deal with: Start Time, Exam Date, 
Student Name and Paper Number. It turns out 
that the most efficient and unbiased arrangement 
uses a pair of OLSs of order 4, preferably chosen 
at random. All that Tarquin need do is to specify 
which treatment will label the rows, which will la- 
bel the columns and finally to use the other two 
treatments as sets of symbols with which to fill the 
square. For example, one solution might be to use 
the Student Names as row labels, the Exam Dates 
as column labels and then to fill the table with the 
Paper Numbers and Start Times. A similar strat- 
egy will work for any similar experiment; such 
arrangements are called Pairwise Balanced Designs 
and are frequently used to ensure efficiency and 
elimination of bias when designing an experiment. 


Now, suppose that Tarquin is dissatisfied with the 
way the examinations were run this year. He be- 
lieves that the students had not been examined 
thoroughly enough, so decides to make the exams 
harder by introducing two new papers. By sheer 
luck, two new students join the course the fol- 
lowing year, so that now we have six papers to be 
taken by six students over six days, each at six dif- 
ferent starting times. Tarquin, now knowing that 
the key lies in orthogonal latin squares, proceeds 
to construct a solution. However, after many long 
and painful attempts, he is reduced to a blubber- 
ing wreck on his office floor, since he is unable 
to find one. He has convinced himself of his own 
stupidity, but he need not be so harsh on himself... 


Euler’s Conjecture: 
or, Tarquin redeemed 


Earlier I mentioned the famous “Probleme des 36 
Officiers” that was first stated by Euler. Like Tar- 
quin, Euler devoted much effort to solving this 
problem, but was also unable to find a solution. 
Euler had already developed techniques to con- 
struct pairs of Orthogonal Latin Squares of order 
n for n odd or divisible by 4. Given the (trivial) 
impossibility of the case n = 2 and his unsuccess- 
ful attempts for n = 6, Euler made a bold claim: 
There do not exist a pair of Orthogonal Latin 
Squares of order 4t + 2, for any t > 0. One can 
only imagine Euler’s reasoning for making such 
an extreme claim on the back of two pieces of evi- 
dence; I daresay it smacks of Physicists’ Induction. 


A first cry of success came in 1900, when GasTON 
Tarry published a proof that the case n = 6 is in- 


deed impossible. Tarry was born in France and 
moved to Algeria to work as an administrator. Al- 
though an amateur, he had an amazing capacity 
for combinatorial problems. His proof proceeds 
as follows: he began by reducing all order 6 Lat- 
in Squares to 17 types, via careful and painfully 
detailed reasoning about cycle types in S;. From 
there, he reasoned that for a solution to exist, 
we must find such a Latin Square that possesses 
a complete set of transversals. A transversal is a 
subset of n cells within an nxn square, labelled 
with the symbols 1 to n, such that in each row and 
column of the square there is a cell of the subset. 
At that point, he took a minor detour to give an in- 
teresting study of the n = 4 case using transversals. 
Returning to the proof, he managed to reduce to 3 
the number of Latin Squares that need to be con- 
sidered by showing that some cells of some Latin 
Squares cannot be part of any transversal. Finally, 
he considered these three cases in turn, using a 
greedy algorithm which he calls The Method of 
Order to show that no orthogonal mate can exist 
for any of them. While we can be glad that this 
proof resolves the problem, it does lack a certain 
panache; it tells us that no orthogonal mate can 
exist, but doesn't tell us why it can't exist. Indeed, 
even Tarry was disappointed by this; he writes, 
“The method of order, which does not shed any 
light on the problems it resolves, should not be 
used unless we cannot do otherwise; it is a last 
resort’. 


Since Tarry’s proof was published, shorter and 
more informative proofs have been found. See, for 
example, FISHER AND YATES (1934), YAMAMOTO 
(1954) or STINSON (1984). Stinson’s proof rather 
interestingly highlights a link between Latin 
Squares and coding theory! 


The Fall of Euler’s Conjecture 


Following the proof for the case n = 6, the subject 
seemed to have died down slightly. This changed 
in May 1959, when Raj BosE and SHARADCHAN- 
DRA SHRIKHANDE managed to construct a pair 
of Orthogonal Latin Squares of order 22, thus 
disproving Euler’s Conjecture. In fact, it didn't 
stop there; shortly after that paper was published, 
ERNEST PARKER published a paper in which he 
presented an example with n = 10. Euler’s Conjec- 
ture was crumbling fast; in fact, he couldn't have 
been more wrong. In one final paper published 
by Bosz AND SHRIKHANDE in 1960, they proved 
that there exist a pair of OLSs of order 4¢ + 2 for 


infinitely many t. To add insult to injury, the proof 
didn’t even use very advanced techniques; it relied 
on fairly straightforward properties of finite fields 
and techniques from Combinatorial Design theory. 


The sudden success with which Euler’s Conjec- 
ture was disproved sparked a new wave of inter- 
est in Latin Squares. To capitalise on this inter- 
est, JOSZEF DENES and DONALD KEEDWELL wrote 
a comprehensive volume on Latin Squares in 
1974. It became an instant hit; such was the in- 
terest in Combinatorial Designs that a sequel was 
published 17 years later. Between them, the two 
volumes cover many aspects on the theory and 
applications of Latin Squares and contain no less 
than 4 complete chapters devoted to the idea of 
Orthogonality. Applications included Experimen- 
tal design, Statistics, Error-correcting codes, Al- 
gebra and Geometry, to name only a few. 
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Archimedeans Annual 


Problems 


1 Dazzling Dice 


NoLUI ce) l eM (ieclaretel com reli mipcesi(e(ere MA iuaMi(e(=tomm Inco) Rel (=M-VaMl avilayincy 
number of times, recording the total score attained so far after each 
roll as a sequence. What are the most and least likely numbers to 
appear in this sequence, and with what probabilities do they occur? 


2 Snappy Surds 
Find all integers such that 


n—4/n-19 = neg aha Prana Fate : . 


is also an integer. * vy A Nat ai 
iq ij 7 


967,535 
999,917 
999,919 1,093,411 


4 Compelling Convergence 


Determine whether the following series converge or diverge, and determine the value of 
any that converge. 


We 2°. 2sinn 2(1— sin’ n) se 


© |sinn (log 2)?"*! 
ea e, En 


2 
qouleat FS RG 1—cos4n Se itar dl 


| 24 | ha non- trivial integer n with the property that 
at ie isa perfect square, in this case 707. =. = 
. + sz — . 


5 Superb Sets 


For each of the following sets, determine whether it is finite, count- 
able or uncountable. Give the explicit sizes of the finite sets, and for 
any uncountable set, determine whether it bijects with R. 


* Group homomorphisms (Z,+) — (Q,+) 

* Group homomorphisms (Q,+) — (Z,+) 

e Equivalence relations on Q 

¢ Sequences in Q that converge to some member of Q 


AY, 


6 Triumphant Treasures 


The planet Zog has radius 1 has an associated geostationary moon of 
negligible radius. You have followed the evil space-pirate Blackmous- 


tache to this system, in which he has buried his treasure. You know that: in 

i. The moon lies a distance A from the planet, with 1 <A < 1.5; = 

ii. |The centre of Zog is denoted Z; 

iii. The city of Luna lies on the closest point of the planet to the moon; " 

INVA Mat=mol ig yao) my -ValulLU lar Miw-laua] execel-1 (cl md al-Meli alla lave Moya al-el-]aa-it=19) 
icon MUlal-} 

v. The treasure lies at least A away from Antiluna; ? 

vi. The city of Luna produces so much toxic waste that any point Pin | 
the planet with <PZL < a cannot contain the treasure; | 

vii. The core of Zog is molten, so the treasure does not lie within it; —— 


viii. If the core has volume V and surface area A, then 
A M4 Zhi the 
Fee a ay eaten EU = Ie 


ix. If the treasure lies at the point T, then the following inequality 

holds: |T'Z| sin (4 TZL) < \/(A2 =1) sin’ a + 4 sin?(2e) - } sin(2) ; —— 
x. The city of Midi lies exactly halfway between Luna and Antiluna, 

with antipodal city Centra. Then the treasure is known to be in the 

felt-lat-meo)aie-}ialiavem MUlar-PmlUicelm-laremavalalllat-mr-]ale mon ol-W-)m (-t-hiar-15 

close to Midi as to Centra. 


0 8, O82) 


Where is the treasure buried? 


e e 
7 Curious Coins 
MAVon o)-)\(=1 a0 ©) [-hyar-Mef-lan\-Ke)al-lalu am q@ancve [0l-1x-mic-] 6) (-me)amval(eamee)|alKe)ime|-lan(-i\-1an Mem o)-La-te| 
in turn. The winner is the one who plays the last coin. For which n do you want to play first? 


NB: The coins must have their centre above the table, must be placed flat and cannot be stacked. 


Smallest pseudo prime satisfying 7” = 7 mod n. 
Automorphic, aspiring and non-sociable number 


Airey . 


iy. Peavy (a) 
7: iPerceptive Polygons am 


istetollaMaNaneclanste [Ul clecle-lmeat-lave](Mormicol-M (cael ua Mm ir-lave ol e-\ay 
eel coal Ulanvelicol(-wVoLol Ul mani onell ColUlpalsveld] oX-- Mo [Ur] c-mr-laremdatcia 
draw the circle around this. Repeat this infinitely many 
id[aatexouar-(@ am diaatemelcelUlaatxeldle)iatem-mcaxeUl(-laanse(elam-lcellavem uals 
ColUicclanatosimel| col [-mr-latemaalclamelca\iilaremaat-mell cele lanvel| col(-Meyim uar-ye 
forming the new outermost circle. Does this object fit inside 
EMell cel (emo) me-le Lum ke LOK 


T > ey Hey | * 


ae mh | ) fy 

ees a: ie eae | a - —_ _ a = 
e sul ate it ele 3s 
— lengt! iderdroppingaper- H 
as see te aR 2 re ‘ 10 Rough Relations 
-_Spposite side. Then, repeat this pro- 
WEES s, spiralling in clockwise, as in Hae A; 
picture. Where (in Cartesian coordi- 
nates, calling the bottom left vertex 
the origin) is the point to which this 
process converges? 


Let R be a relation that is “anti-transi- 
tive”, that is if dRb and bRc, then cRa. 
Then, define f(n), for n EN, as the least 
m €N such that there exists a set T, 
with |T| =n and a,b €T= aRb or bRa, 
so that exactly m unordered pairs 

(s, t) € Tx Thave the properties: 


i. 
ii. sRt and tRs. 
Find, for all n € N, the value of f(n). 


Sesie 


11 Gorgeous Geometry 


Let C be the mid-point of OD, and let Q lie on the semicircle through D 
with centre C, whose diameter is perpendicular to OD. Points A and B lie 
Tad au o)f-lal-me) mdal-mX) nal (ell col [-mur-lc-M=xe [Ul(e(cir-lamice)aomOn-lavenr-| Xonice)anm OFM Nal) 
point R completes the rhombus QARB. 


Find the locus of R as Q traverses the semicircle, with the distances OA, OB, 
(o)sAy-Vre-lateW sJamctant-lialiare Mid <ae 


26 Only positive number to be directly between a 
square and a cube. Number of sporadic groups. 


12 Mysterious Matchings 


This problem can be solved easily by 5 to 10 year olds: 


12350 8738 — 4 0000 — 4 8317 > 2 
11011 3275-0 9834 4 2814 — 3 


2222-0 21761 9393 > 2 5656 — 2 3821 >? = 
3535-0 8261 — 3 7272-0 0909 — 4 
4 9232-1 7068 — 4 1818 ~4 77770 


13 Dazzling Digits 


Hardy and Ramanujan are playing a game, where on each turn, Hardy 
rarelant=cmxolanr=ael feria audalcolamata\=XoN aXolal ol-Mel Kid) areia ico)pa e)¢-\Ulelucmel(elie) mr-lare| 
then Ramanujan inserts it into the expression +**** — ***x, in place of 
one of the stars. 


Hardy is aiming to maximise the value of the expression, Ramanujan to 
FaallalianlKxMieM s{-11aXe Mm Gr-lanle)atelel-Mantclantlanrcidcelclakymdal-\’m olend al o)[-\a e\-larcveld \VA 
What is the value of the expression? 


Across 

3 Commuter (BA) pouring ale 
mixture. (7,5) 

a Tolmtlarel(-melmarendaliare ja (o)) 

5 With an angle that small, 
there’s no degrees here. (7) 

7 Rain men confused, but 
ial Mel kxee\ aleve mel licciccral del 
geometry. (7) 

8 One bash in frontless city — it 
fixes everything. (8) 

9 Three different sides of disc — 
ale needed. (7) 


Down 

ss =e [Url Mata molarmaelale-lialiare| 
InCclaallixelakcme] cole) om Gp) 

2 Mix paint on hide of Archi- 
medes’ cows. (11) 

3 That’s sum royal snake! (5) 

6 Also intersection. (3) 


Solutions to the Archimedeans Problems Drive can be found on page 94. 


In the Collaz Conjecture you need 112 steps to 


get from 27 to 1. Sum of the digits of its cube. 
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Pi in Fours 


John Conway and Michael Guy 


First published in issue 25, 1962 


r | “Vhe famous “four 4s problem” asks you to ar- 
range four 4’s, and any number of the ordi- 
nary mathematical symbols, to give as good 

an approximation to Pi as you can find. 


We shall allow the symbols (, ), +, -, x and +, the 
usual notations for roots ¥ and *V, powers, factori- 
als and the decimal notation 44, .4 and .4. Pi itself, 
logarithms and trigonometric functions may not 
be used. Factorials are to be of integers only, oth- 


erwise 7 = \/ (-V/4/4)!4. We shall also not allow 


such monstrosities as AVA. 


For example, 


4+ 4\*" 
a) 


is a very good approximation to e, and can clearly 
be modified to be as good as we please. It can fur- 
thermore be improved so as to only use three 4’s, 
since, as n > oo, n/ Vn! > e. 


We may derive similar “explicit” formulae for var- 
ious interesting numbers. Thus n¥/a — n > log a, 
so that we obtain a sequence of approximations 
to log 2, log 5, and logab for a variety of rational 
aand b (e.g. log 92 or log 93). Our best result of 
this kind for z has seven 4’s, and is derived from 


2" n! ; 
= lim | ———__] . 
=e (Tae vem) 


Second perfect number. Sum of the 
totient functions of the first nine integers. 


We can also find log in seven 4’s, but as yet we 
have not been able to find any formula of this kind 
for Euler’s constant y. 


We shall now show that the above devices are un- 
necessary. In fact: 


Theorem 1 Any real number may be approxi- 
mated arbitrarily closely using only four 4’s 
and the usual symbols. 


Proof: It follows from the formula n (Ya -Vb ) > 
log(a/b) that for sufficiently large n we have 


amon (ge nh? ge) 
< gmt 


for the limit of this expression as n — oo is 
2™log 4, and 1 < log 4 < 2. If now m is any integer 
and n > m, both n - m and n - m — 1 are posi- 
tive, so that we may write the expression above as 
2"/"™"4 - ,/" 4), the indices of the root 
signs indicating repetitions. Taking square roots 
k times, we have 


aml <P (Jae (4-4) 


2 m4) /2* 


Now we may take n to be of the form 4(!)’ so as 
to satisfy all the above conditions, and the the 
expression between the inequality signs will use 
only four 4’s. Since the numbers 2””?" for integers 
m and positive integers k are dense in the positive 
real numbers, we have proved our theorem. (For 
a negative number we need merely add another - 
sign.) 


Theorem 2 If we allow use of the integer part 
sign, every integer is representable with four 
fours and every rational number with five. 


The first part is obvious, and the second part be- 
comes a corollary of the first when we note that 
any rational p/q equals m/4(!)” for suitable inte- 
gers mand n. 


We may modify Theorems 1 and 2 so as to use 
other (positive integral) numbers instead of 4’s. 
The only condition is that at most three of these 
may be 1’s. 


Finally we pose these questions: 


e Is there an “explicit” formula for 7 
with less than seven 4's? 


e Is there and explicit formula for y? 
« Are the numbers \/"(4(!)”) dense in x > 1? 


Tetranacci number. Saturn i 
@ requires 29 years to orbit the Sun. 


BOWE 


g 
. 


30 


a4 
proved the 1904 


Bailie in 2003, he gave a 
h proof of Thurston's stronger ge- 
ometrisation conjecture, which had been around 
since 1982. Both concern 3-manifolds. But what 
about the geometrisation theorem’ little sister, 
which deals with 2-manifolds? 


Known as the classification theorem for closed 
connected surfaces, this gives a complete list of all 
closed connected 2-manifolds up to homeomor- 
phism, enabling any such surface to be slotted 
into one of two simple categories. 


First announced in 1888, with a proof that turned 
out to be incomplete, the classification theorem 
was proved in 1907, albeit assuming triangulabil- 
ity, which was only proved in 1925. 


Nowadays, it crops up in various second and 
third-year Tripos courses, including IB Geometry, 
II Differential Geometry, Il Algebraic Topology. 
But only stated, not proved. 


There is an elementary proof, requiring little more 
than a basic knowledge of triangulations. This was 
first given by Christopher Zeeman in the 1960s. 


Definitions 

A surface, or 2-manifold, is a topological space 
that’s locally homeomorphic to R”. In other words, 
any point has a small neighbourhood which is ap- 
proximately flat. Take for example the surface of 
the Earth: from close up it looks flat, and you need 
to get a long way away to see that it isn't. 


Icosahedron and dodecahedron have 30 edges. 
The icosidodecahedron has 30 vertices. 


j 


Surface Differences _~ 
SY 


A surface is connected if it’s all in one piece; and 
closed if it has no boundary and can be expressed 
as a finite union of discs. So a cylinder isn’t closed, 
and nor is any unbounded surface in 3D, but a 
sphere and torus are. For brevity, we'll use ‘surface’ 
to mean ‘closed connected surface’ 


Two surfaces are homeomorphic if there is a con- 
tinuous bijection between them with a continu- 
ous inverse: intuitively, if they are ‘topologically 
equivalent’ in the doughnut-teacup sense. 


We assume all surfaces are triangulable: in other 
words, that any surface is topologically equivalent 
to a polyhedron with flat triangular faces. 


The Euler characteristic is the quantity y= V- E+ 
F, where V, E, F are the numbers of vertices, edges, 
and faces of the triangulation. This is an invariant: 
any two triangulations of the same surface have 
the same Euler characteristic. 


And off we go... 


The Theorem and Proof 


The classification theorem states that any closed 
connected surface S is homeomorphic to one of 
the following: 


« if it’s orientable, the sphere with g handles 
glued on, i.e. the g-holed torus, for some g = 0; 

« if it’s non-orientable, the sphere with h M6- 
bius bands sewn in, for some h 2 1. 


Gluing on a handle means removing two small 
discs on the sphere and sticking the two edges of 


We'll need the foll owi g Pit 


. { 

of: Note that the Euler characteristic of a 
s x = V - E, since a graph has no faces. If 
is a tree, i.e. it has no closed loops, then 
t can be shrunk to a point, so its Euler charac- 
teristic is 1. If it isn’t a tree, then removing one 
edge from a closed loop increases the Euler char- 
acteristic by 1, and we can get a tree after finitely 
many such operations. So for a graph we always 
gety<l. 


__ surface is at m 


Take a triangulation T of a surface S and consider 
its dual triangulation D, formed by putting a ver- 
tex at the centre of each T-face and a face with 
centre at each T-vertex. Let M be a maximal tree 
in D, defined as a tree to which no more edges can 
be added without creating a closed loop, and let 
C=D\M. 


Since M is a tree, C is connected. Since M is maxi- 
mal, M contains all vertices of D. So there are bi- 
jections 


{T-triangles} <> {M-vertices}, 
{C-edges} <> {D-edges}, 
{C-vertices} <> {C-vertices}. 


Therefore y(S) = x(M) + y(C) < 2. 


Lemma 2 If Sis a surface which is discon- 
nected by every closed curve on it, then it is 
homeomorphic to the sphere. 


Proof: Let T, D, M, and C be as before. If C con- 
tains a loop, then this loop disconnects S; each 
connected component must contain a D-vertex, 
and any two D-vertices are joined by edges in M. 
Contradiction, so C is a tree. Let X be the set of 
points in S closer to M than to C, and Y be the 
set of points in S closer to C than to M. Each of X 
and Y is a fattening up of a tree, so they are both 
homeomorphic to the disc. But S is just X and Y 
glued together edge-to-edge, so S is homeomor- 
phic to two discs glued edge-to-edge, i.e. to the 
sphere. 


We now use the following surgery algorithm on 


an arbitrary surface S. 


1. If Sis disconnected by every closed curve 


on it, stop. 


If there is a non-disconnecting closed 
curve on S, remove a thin strip around this 
curve; this strip must be a cylinder or a 
Mobius band. 


If the strip is a cylinder, glue in two discs 
to the gaps left in S, increasing y(S) by 2, 
and mark both of them with an orientation 
(clockwise or counterclockwise) so that 
they agree along the cylinder. 


If the strip is a Mébius band glue in 1 disc 
to the gap left in S, increasing y(S) by 1. 


5. Gotol. 


By Lemma 1, the process stops after finitely many 
steps. By Lemma 2, the surface we get when it 
does stop must be a sphere. 


Now start from a sphere and reverse the process 
to get to S in finitely many steps. In each step, we 
have three possibilities for what needs to be re- 
placed: 


a. 1 disc; 

b. 2 discs with different orientation (one 
clockwise, one counterclockwise); 

c. 2 discs with the same orientation (both 


clockwise or both counterclockwise). 


If it is (a), we're sewing in one Mobius band. If it 
is (b), were gluing in a handle. If it is (c), we're 
sewing in a Klein bottle. But a Klein bottle is just 
two Mobius bands, so we can ignore (c) without 
loss of generality. 


So we can get to any surface S by starting with a 
sphere and putting in finitely many Mobius bands 
and handles. 


If Sis orientable, then it can’t contain any Mobius 
strips, so it’s a sphere with finitely many handles. 


If it is non-orientable, then we must sew in at least 
one Mobius band. If we ever glue in a handle, then 
we can transport one of the two differently-orient- 
ed discs around this Mébius band so that they’ve 
both got the same orientation. This reduces (b) 
to c), which we've seen reduces to (a). So S is a 
sphere with finitely many Mobius bands. And this 
completes the proof. 


Number of musical triads (12 major, 12 minor, 4 
diminished, and 3 augmented). 11111 in base 2. 
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First published in issue 57, 2005 


ne of the most often cited applications 
@ of the study of fractals is that of their 

use in image compression. Such an 
application is not surprising, since seemingly 
complicated and intricate fractal images have 
relatively simple mathematical descriptions 
in terms of iterated mappings. Given also that 
fractals have been found to model well a wide 
variety of natural forms, it seems natural that 
we should try to exploit their self-similar prop- 
erties to encode images of such forms. 


We examine a simple example of a fractal, the 
Koch curve, to illustrate the principle of encod- 
ing a fractal image. Referring to Figure 2, we 
construct the Koch curve by first taking a line 
segment of length 1, Ky. We then construct K, 


Figure 1 Real and 
computer generated Ferns 


Freezing point of water at sea level in Fahrenheit. . 
Ninth Happy Humber. 1.4274 33= 32. § 


by combining the images of this segment under 
four transformations, each involving a dilation 
of factor 1/3 composed with either or both a ro- 
tation and translation. Combining the images of 
K, under the same four transformations yields 
K,, and the Koch curve itself (K,,) is the limit 
of this process as it is iterated. (Peitgen, in [4], 
calls this method of drawing fractals the “Mul- 
tiple Reduction Copy Machine” or MRCM.) 


We can see that this object is self-similar, in the 
sense that we can find arbitrarily small por- 
tions of the curve that are related to the whole 
by a similarity transformation. 


The fractal fern of Figure 1 is also the limit 
of four affine transformations iterated in the 
same manner. Since each affine transformation 
may be represented by a 2x2 matrix giving the 
homogenous part of the transformation anda 
2-component vector giving the inhomogeneous 
(translation) part of the transformation, a fig- 
ure that is the limit of n iterated affine transfor- 
mations can be encoded as a collection of 6n 
real numbers — a much more efficient encod- 
ing than a pixel-by-pixel representation. These 
ideas also generalise in an obvious manner to 
subset of higher dimensional Euclidean space. 


We might then ask whether we can measure 
how ‘close’ a perfectly self-similar or self-affine 
fractal is to a given ‘imperfect’ real life image 
that we are trying to approximate. We might 
also ask how many iterations of the kind de- 
scribed above we need to carry out to get a rea- 


sonable approximation of the limiting set. More 
theoretically, we might question whether we can 
be sure that such iterations will indeed tend to a 
definite limit, and, given that any such limit will 
be invariant under the iteration, whether it matters 
with what set we start. Could we, for example, have 
begun our construction of the Koch curve with a 
circle rather than a line segment? 


[im Figure 2 Construction of the Koch curve 


In this article, we shall see that, by considering sub- 
sets of Euclidean space as points in a metric space, 
we can measure how different two images are, and 
by applying the contraction mapping theorem, we 
can see that limit sets of the sort described above 
do exist, that our starting point in their construc- 
tion does not matter, and we can also obtain an 
estimate for how rapid the convergence is. 


r 


| Definitions Definition 4 Let (¥,d) be a metric space. ies 
Then f: X > & isa contraction if there ex- 


ists a non-negative real number c < 1 such 


that d(f(x), f(y) < cx d(x,y) for all x,y € X. 


For reference, we enumerate here a few stand- 
ard definitions and theorems that we shall use 
later. 


= Our central theorem tells us about the be- 
Definition 1 A metric space is an ordered 


ra haviour of contractions under iteration (for a 
e pair (¥,d), where Visasetandd:X¥xX—>R proof, see, for example, [3]). 
is a function with the following properties: 
() d(xy)20 VayeX, Theorem 5: Contraction Mapping 
J with d(x,y) = 0 ifand only ifx = y; Let (4d) be a non-empty complete 


a B : metric space and f: Y > X a contraction. 
tii) GCC) 5 DIN ey eX: Then there exists a unique x) € Y such 
(iii) d(x, z) <d(~y) + d(y,z) V xy,ze X. that f(x9) =x, and furthermore, 


Isto wolf (62) Sa% Kor alll see Ze 
The notion of convergence of a sequence to a nsoof"() = x0 


limit carries over to metric spaces in an obvi- 


In the final section we will refer to a corollary: 
ous way, as does the following related notion: 


Corollary 6 Let (4d) be a non-empty 
complete metric space and f: ¥ > X¥ such 
that f” is a contraction. Then the same 
conclusions hold as for Theorem 5. 


Definition 2 Let (x,,) be a sequence of points 
in a metric space (4, d). We say that (x,,) is 
Cauchy if, given e > 0, there exists an N¢ N 
such that for all m,n =N, d(Xp)X,) <€. 


For most of the time, we shall restrict our at- 


Clearly every convergent sequence is a Cauchy tention to compact subsets of metric spaces. 


sequence. The converse is also true for an im- 
portant class of metric spaces: Definition 7 Let (X,d) be a metric space. 
Then we say A C ¥ is compact if every 


Definition 3 A metric space (4, d) is covering of A by open sets has a finite 


complete if every Cauchy sequence in 


subcovering. 
converges. 
The important properties of compact sets 
i's f We remark that the metric space formed by R" which we need are that they are closed and 
1 with the usual Euclidean metric is complete. bounded. 


4 Large t integer that isn’t the sum of distinct 
angular numbers. Sum of the first four factorials. 


Hausdorff Distance 


Our starting point is a way of turning a collec- 
tion of subsets of Euclidean space into a com- 
plete metric space, so that we can talk about 
limits and convergence, and make use of the 
considerable information provided by the con- 
traction mapping theorem. The concept we 
require is due to Hausdorff, who formulated 
a notion of ‘distance’ between compact sub- 
sets of a metric space which makes the set of 
| compact subsets of a given metric space into a 
sate metric space itself. Furthermore, if our initial 
‘| metric space is complete, then so is the space 
of compact subsets with the Hausdorff metric. 


We require a further concept before introduc- 
ing the Hausdorff distance itself: 


Definition 8 Let A bea subset of a metric 
space (¥,d). The e-collar of A, denoted A,, 
is the set {xe V: Ja€ A with d(a,x) <}, 
ice. the set of all points at a distance at most 
é from the set A. 


Definition 9 Let A and B be compact 
subsets of a metric space (4, d). If we write 
p'(A,B) =inf{e>0:A ¢ B,} then the 
Hausdorff distance p(A, B) between A and 
B, is defined by p(A, B) = max{p'(A,B), 

f = p-(B, A)}. 


It follows straightforwardly from the definition 
that p’ satisfies all the axioms for a metric space 
in definition 1 except (ii), so the final part of 
the definition is essentially a symmetrisation. 
An alternative definition sometimes used (for 
example in [3]) but which does the same job is 
p(A, B) =p’ (A,B) + p'(B, A). The proof that the 
resulting metric space inherits completeness is 
given in [2] and as an exercise in [3]. 


i's 
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The Hutchinson Operator 


Now that we have some way of measuring 
‘closeness’ of compact subsets of metric spaces, 
our next task is to show that the iterated trans- 
formation applied in Figure 2 to construct the 
Koch curve is indeed a contraction, so that we 
may apply Theorem 5. The following treatment 
follows quite closely that of [4]. We work in R™. 


We have a collection of affine transformations, § 
T,, T» ...» T,» and at each iteration we apply the 
transformation ; 


(This is known as the Hutchinson operator, 

after Hutchinson who first analysed its prop- ff 
erties.) We impose the condition that each T; ‘ 
should itself be a contraction with respect to 

the Euclidean metric, with constant c; < 1. " 


We now show that T is a contraction with 

constant c = max{c,, Cc, .... c,} on the met- 

ric space of compact subsets of R” equipped ' 
with the Hausdorff metric. (See diagram below 

for the following.) Let A and B be compact 

subsets of IR” with p’(B, A) = 6. Then for any 

e > 6 we have B ¢ A,. Clearly then T;B ¢ T;A,, 

for each i, but since T; is contractive on R”, 

TA, € (Ty A)e, where €; = cje< ce. Hence T;BC ff 
(T;A)e, & (T,A)ce> yielding f ‘ 


U T;BC U (TiA) ce = (U na] 
i=l i=) i=] 


cé 


So TBC (TA),, for all e> 6, and hence p’(TB, TA) 
< cd. Therefore p(TB,TA) < c x p(A,B) and so 
T is indeed a contraction. 


An important practical observation which can 
be made from the above proof is that the con- 
traction constant calculated for T is equal to 
the largest of the individual contraction con- 
stants of the transformation T,. It is clear from 
the proof that, in general, we can do no better 
than this. In the usual proof of the contraction 
mapping theorem, it is shown that, for a con- 
traction f with constant c, 


(f(x), F(x) < d(x, Fx) 


Since this inequality holds for all k, the expres- 
sion c"/(1 — c) provides an estimate for how 
quickly the iterations converge to the unique 
fixed point. As might be expected, we see that 
the larger the constant c, the slower the con- 
vergence. Hence the MRCM method of draw- 
ing fractals is only as rapid as is allowed by the 
‘least contractive’ contraction. It is, however, 
worth remarking that a given transformation 
may or may not be contractive, depending on 
the choice of metric, and that the contraction 
constants will vary according to the metric 
used. Since the notion of Hausdorff distance 


works for any metric space, not just IR” with 
the Euclidean metric, we may certainly replace 
the Euclidean metric in the above analysis with 
any other making R” into a complete metric 
space, to be able to draw conclusions about the 
convergence properties of a wider variety of 
Hutchinson operators. 


Julia Sets 


We conclude with some brief, informal re- 
marks about how these ideas may be applied 
to producing images of another rather famous 
class of fractals. For a given polynomial f: C > 
C, the Julia Set of f J(f), is the closure of the 
set of repelling (unstable) fixed and periodic 
points of f. This is non-trivially equivalent to 
the definition as the boundary of the basins of 
attraction of the attractive fixed points of f (for 
details see [1]), and the set J(f) has the prop- 
erty that f(J) = f"(V) = J. The most famous 
example of these objects are those associated 
with the mapping f: z+ z? + c for various 
c € C (like the example shown in Figure 3). In 
this case we notice that the inverse mapping 


Numberof hexominoes, the polyominoes 
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flize {/z—c} seems to play the role ofa _ repelling periodic point of f (whose period we 
non-linear Hutchinson operator, in that each _ shall denote by p), which in ruen will be an 
point (other than c itself) has two images, and _ attractive periodic point of f;° : z> +/z—c 
the fractal of interest is invariant under the and f,' : z+ —/z—c. Hence the iterate T?  [P * 


wie transformation. of the Hutchinson operator T defined by these Co 
aL two mappings will be a local contraction, and hy 

s. aaj We might well then ask whether the mappingis _ so Corollary 6 suggests that, at least if we con- co 
tr contractive. Here a partial answer is suggested _ sider sets not ‘too far’ in terms of Hausdorff "aha 


tells us that for a conformal mappingg:C +C, in the same manner as for the self-affine frac- 

the approximate scaling in length near a point __ tals discussed above. In fact the convergence is 

Zo in C is |g’(zo)|. The criterion fora fixed point —_ very good, and although after a finite time the -— 

Z of a mapping g to be attractive, viz. |g'(z))|< iterates do not in general approximate all parts ; 

fis "oa j 1, is therefore the same as the criterion for the __ of the Julia set evenly, this is how many fractal 
_+ = .3] mapping to be locally contractive. Any point _ packages produce their images of Julia Sets. 
~*] close to J(f) is, by definition, close to some 


rit 
by the theory of conformal mappings, which _ distance from J(f), the iteration will converge — Ci » 
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36 Smallest non-trivial square-triangular number. 


The sum of the first 36 integers is 666. 


Human body temperature in degrees Celsius. 
Number of plays by Shakespeare. 


Stein’s Paradox 


Dr Richard J. Samworth, Statslab Cambridge 


tics arises in a remarkably simple estima- 

tion problem. Let X, .... X, be independent 
random variables, with X; ~ N(6;, 1) for i = 1, 
p. Writing X = (X,,..., Xp)", suppose we want 40. 
find a good estimator 6 - 6(X) of @ = (0,,... Ae 
To define more precisely what is meant by a ‘good 
estimator, we use the language of statistical deci- 
sion theory. We introduce a loss function L(6, 0), 
which measures the loss incurred when the true 
value of our unknown parameter is 0, and we esti- 
mate it by 8. We will be particularly interested i - 
the squared error loss function L(6, 0) = 
where || - || denotes the Euclidean norm, but einer 
choices, such as the absolute error loss L(6, 0) = 

? ,|6, - 0,| are of course perfectly possible. 


P erhaps the most surprising result in Statis- 


Now L(6, 6) is a random quantity, which is not 
ideal for comparing the overall performance of 
two different estimators (as opposed to the loss- 
es they each incur on a particular data set). We 
therefore introduce the risk function 


R(6, 0) = E{L(6, 0)}. 


If @ and 6 are both estimators of 6, we say 6 
strictly dominates 6 if R(6,0) < R(6,0) for all 6, 
with strict inequality for some value of @. In this 
case, we say 6 is inadmissible. If 6 is not strictly 
dominated by any estimator of 0, it is said to be 
admissible. Notice that admissible estimators 
are not necessarily sensible: for instance, in our 


CS 


Largest even number which cannot be written 
as the sum of two odd composite numbers. 


problem above with p = 1 and the squared error 
loss function, the estimator 6 = 37 (which ignores 
the data!) is admissible. On the other hand, deci- 
sion theory dictates that inadmissible estimators 
can be discarded, and that we should restrict our 
choice of estimator to the set of admissible ones. 


This discussion may seem like overkill in this 
simple problem, because there is a very obvious 
estimator of @: since all the components of X are 
independent, and E(X,) = 0; (in other words X; 
is an unbiased estimator of 6;), why not just use 
6°(X) = X? Indeed, this estimator appears to have 
several desirable properties (for example, it is the 
maximum likelihood estimator and the uniform 
minimum variance unbiased estimator), and by 
the early 1950's, three proofs had emerged to show 
that 6° is admissible for squared error loss when 
p = 1. Nevertheless, STErN (1956) stunned the sta- 
tistical world when he proved that, although 6° is 
admissible for squared error loss when p = 2, it is 
inadmissible when p > 3. In fact, JAMES AND STEIN 
(1961) showed that the estimator 


ea] 
xX 


strictly dominates 6°. The proof of this remark- 
able fact is relatively straightforward, and is given 
in the Appendix. 


A/S (x) -(1- 


One of the things that is so surprising about this 
result is that even though all of the components 
of X are independent, the ith component of 6/S 
depends on all of the components of X. To give 
an unusual example to emphasise the point, sup- 
pose that we were interested in estimating the 
proportion of the US electorate who will vote for 
Barack Obama, the proportion of babies born in 
China that are girls and the proportion of Britons 
with light-coloured eyes. Then our James-—Stein 
estimate of the proportion of democratic voters 
depends on our hospital and eye colour data! 
The reader might reasonably complain that in 
the above examples, the data would be binomi- 
ally rather than normally distributed. However, 
one can easily transform binomially distributed 
data so that it is well approximated by a normal 
distribution with unit variance (see the baseball 
example below), and then consider the estimation 
problem on the transformed scale, before apply- 
ing the inverse transform. 


Geometrically, the James-Stein estimator shrinks 
each component of X towards the origin, and it 
is therefore not particularly surprising that the 
biggest improvement in risk over 6° comes when 
||@|| is close to zero; see Figure 1 for plots of the 
risk functions of 6° and 6/8 when p = 5. A simple 
calculation shows that R(6/5, 0) = 2 for all p > 2, 
so the improvement in risk can be substantial 
when p is moderate or large. In terms of choosing 
a point to shrink towards, though, there is noth- 
ing special about the origin, and we could equally 
well shrink towards any pre-chosen @) € R? using 
the estimator 


p-2 
fae) ae 


In this case, we have R(Of, 0-6,) = R(O, 8), so 
6% still strictly dominates 6° when p > 3. 


01° (X) = 65+ (1 


Note that the shrinkage factor in GF becomes 
negative when ||X - 6,|* < p - 2, and indeed it 
can be proved that 6/} is strictly dominated by the 
positive-part James—Stein estimator 


R = 2, 
EPR. <etem | oee xX : 
| ) 0 ( |X — || S 9) 
eri f aS 
F re 
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where x, = max(x, 0). The risk of the positive- 
part James-Stein estimator 6 ig = 6 6 is also in- 
cluded in Figure 1 for comparison. Remarkably, 
even the positive-part James-Stein estimator is 
inadmissible, though it cannot be improved by 
much, and it took until SHAo AND STRAWDERMAN 
(1994) to find a (still inadmissible!) estimator to 
strictly dominate it. 


Generalisations and 
Related Problems 


It is natural to ask how crucial the normality and 
squared error loss assumptions are to the Stein 
phenomenon. As a consequence of many papers 
written since Stein's original masterpiece, it is now 
known that the normality assumption is not criti- 
cal at all; similar (but more complicated) results 
can be proved for very wide classes of distribu- 
tions. The original result can also be generalised 
to different loss functions, but there is an im- 
portant caveat here: the Stein phenomenon only 
holds when we are interested in simultaneous es- 
timation of all components of @. If our loss func- 
tion were L(6, 0) = (6; - 0,)?, for example, then 
we could not improve on 6°. This explains why it 
wouldnt make much sense to use the James—Stein 
estimator in our bizarre example above; it is in- 
conceivable that we would be simultaneously in- 
terested in three such different quantities to the 
extent that we would want to incorporate all three 
estimation errors into our loss function. 


Although Stein's result is very clean to state and 
prove, it may seem somewhat removed from 
practical statistical problems. Nevertheless, the 
idea at the heart of Stein’s proposal, namely that 
of employing shrinkage to reduce variance (at 
the expense of introducing bias) turns out to be 
a very powerful one that has had a huge impact 
on statistical methodology. In particular, many 
modern statistical models may involve thousands 
or even millions of parameters (e.g. in microar- 
ray experiments in genetics, or fMRI studies in 
neuroimaging); in such circumstances, we would 
almost certainly want estimators to set some of 
the parameters to zero, not only to improve per- 
formance but also to ensure the interpretability of 
the fitted model. 
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Sum of five consecutive primes (3 +5 +7 +11 + 13) | 39 —= 


and the first three powers of three (3 + 9 + 27) 
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A Figure 1: Risks with respect to squared error loss of 
the usual estimator 9°, the James-Stein estimator &5 and 
the positive-part James-Stein estimator 6/6 when p =5. 


Another important problem that is closely related 
to estimation is that of constructing a confidence 
set for 0, the aim being to give an idea of the un- 
certainty in our estimate of 0. Given a € (0,1), 
an exact (1 - «)-level confidence set is a subset 
C = C(X) of R? such that, whatever the true value 
of 6, the confidence set contains it with probability 
exactly 1 - a. The usual, exact (1 - a)-level confi- 
dence set for 6 in our original normal distribution 
set-up is a sphere centred at X. More precisely, it is 


C°(X) = {9 ER? |9- XI? < x5 (a)}, 


ie x>(a) denotes the upper a-point of the 
xe > distribution (in other words, if Z ~ Xp then 
P {Z >X> 2(a)} = a). But in the light of what we have 
seen in the estimation problem, it is natural to 
consider confidence sets that are spheres centred 
at 6! IS (or gis @» for some Oy € R°). Since the distri- 
bution of ||@/5 - 6||? depends on ||@||, we can no 
longer obtain an exact (1- «)-level confidence set, 
but it may be possible to construct much smaller 
confidence sets - using bootstrap methods to ob- 
tain the radius, for example - which still have at 
least (1-«)-level coverage (e.g. SAMWORTH, 2005). 


A baseball data example 


The following example is adapted from Sam- 
WORTH (2005). The data in Table 1 give the base- 
ball batting averages (number of hits divided by 
number of times at bat) of p = 9 baseball players, 
all of whom were active in 1990. The source was 
www.baseball-reference.com. Fori=1,..., p, let n; 
and Z; respectively denote the number of times at 
bat and batting average of the ith player during 


Player n; Z Ti; 

Baines 415 | 0.284 | 0.289 
Barfield 476 | 0.246 | 0.256 
Bell 583 | 0.254 | 0.265 
Biggio 555 | 0.276 | 0.287 
Bonds 519 | 0.301 | 0.297 
Bonilla 625 | 0.280 | 0.279 
Brett 544 | 0.329 | 0.305 
Brooks Jr. | 568 | 0.266 | 0.269 
Browne 513 | 0.267 | 0.271 


A Table 1: Table showing number of times at 
bat n,, batting average Z; in 1990, and career 
batting average 77;, of p = 9 baseball players. 


the 1990 season. Further, let 7; denote the player's 
true batting average, taken to be his career batting 
average. (Each player had at least 3000 at bats in 
his career.) We consider the model where Z,,..., 
Z, are independent, with Z; ~ n;' Bin(n;, 7). 


We make the transformation 
X; = /nj sin (2Z; - 1), 


and let 0; = \/ni sin”' (27; - 1), which means that 
X; is approximately distributed as N(6,, 1). A heu- 
ristic argument (which can be made rigorous) to 
justify this is that by a Taylor expansion applied 
to the function g(x) = \/ni sin ' (2x — 1), we have 


X; - 0; = g(Zi) - g(mi) » g'(m)(Zi-71 
_ Vmi(Zi— 7) 
Vmi(1-m) 


and this latter expression has an approximate 
N(0, 1) distribution when n; is large, by the cen- 
tral limit theorem. In fact, since min; n; > 400, 
an exact calculation gives that the variance of 
each X; is between 1 and 1.005 for 7; € [0.2, 0.8]. 
For our prior guess 0) = (81, ..-, 9o,)', we take 
6); = Vn sin”! (2m - 1), with 1 = 0.275 and 
n =p! D2, n;. We find that || X- 0||* = 2.56, some- 
what below its expected value of around 9, though 
since the variance of a yj random variable is 18, 
this observation is only around 1.5 standard de- 
viations away from its mean. On the other hand, 
\65,, - O° = 1.50, so Stein estimation does pro- 
vide an improvement in this case. 


Only number whose letters are in alphabetical order. Venus 


returns to the same point in the night sky every 40 years. 


Letting 7 = (m,...,7,) and recalling that@ isa References 
function of 7, the usual 95% confidence set for 7 
is 1. W.James, and C. Stein, Estimation with quad- 
{x € [0,1]? : |X — Ol? < 16.9}. ratic loss, Proc. Fourth Berkeley Symposium, 
1, 361-380, Univ. California Press (1961) 
R. Samworth, Small confidence sets for the 
mean of a spherically symmetric distribution, J. 
(X) - 9? < 12.5}. Roy. Statist. Soc., Ser. B, 67, 343-361 (2005) 
oe a ; 3. C. Stein, Inadmissibility of the usual estimator 
Numerical integration gives that the volume ratio of the mean of a multivariate normal distri- 
of the bootstrap confidence set to the usual con- bution, Proc. Third Berkeley Symposium, 1, 
fidence set in this case is 0.26, so the benefits of 197-206, Univ. California Press (1956) 
having centred the confidence set more appropri- 
ately are quite substantial. 


On the other hand, the 95% confidence set for 4 
constructed using the bootstrap approach is 


{we [0,1]? : |68 


+,0 


4. P Y.-S. Shao and W. E. Strawderman, Im- 
proving on the James-Stein positive-part esti- 
mator, Ann. Statist., 22, 1517-1538 (1994) 


Appendix 


First note that since |X - 0||* ~ x3, we have R(6°, @) =p for all 9¢ R?. To compute the risk of the James- 
Stein estimator, note that we can write 
(p-2)X |’ 
R(@S, 8) = | 
|x|? 


p : a 
=P-Ape-2) », {St cp ar (ime): 


Consider the expectation inside the sum when i= 1. We can simplify this expectation by writing it out 
as an n-fold integral, and computing the inner integral by parts: 


Xi(X,- ex | (xi - 91) yee 22 
| xP an fo lon Gare ee 


of |x|? = 2x0 Lo -tx-oy2/2 
Te 7 Ek x (omprn® dx,...dXxp, 


x-8@ 


since the integrated term vanishes. Repeating virtually the same calculation for components i = 2, ..., p, 


we obtain 
Es) SiS - lal? = 1 x—eyj?/2 
ea xP an > [-- fe ( a ‘loan eae 
> ( (XI? - 28) (2 ) 
= E = 2)E 
ye pe PE pe 


ROS, 8) = p-(p 2B J<p 


We therefore conclude that 


for all 6 € R°, as required. 


Number of Mozart's last symphony. The 
polynomial n2 + n+ 41 gives.primes for |n| < 40. 
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First published in issue 36, 1973 


ne shuffles a pack of cards by dividing 
QO it into two portions and merging them 

onto a flat surface by riffling with the 
fingertips. The problem that concerns us is: How 
many times do we have to do this before the pack 
is perfectly shuffled, and what exactly is the best 
method of shuffling? A pack of cards is said to be 
perfectly shuffled if 


(i) either all possible decks are equally likely 
(this is what we require for patience), 


(ii) or all possible deals of the deck into hands 
are equally likely (this is what we require for 
Bridge). 


The objective of this article is to find an explicit 
solution of the problem in case (i) and give some 
guidelines in case (ii) which may enable some in- 
terested readers to solve the problem in that case 
also. We will start by proving some subsidiary re- 
sults, after noting that any sequence of shuffles of 
a pack of N cards can be regarded as a random 
permutation of the integers from 1 to N. 


Lemma 1 If D(i) fori = 1, ..., kis the set of 
decks which can arise from shuffling a given 
pack once, and if there exists x; > 0 such that 
>i. x; = 1, then there exists a shuffling strategy 
under which P(D(i)) = x,;. 


Proof: Define P; ;(a) = P(a,L| «), where a is a 
sequence of the form (L, R, R, L, R, L, ...) telling 
us whether each of the first j - 1 cards fell from the 
right or the left. Then the shuffling strategy is de- 


Answer to the ultimate question of life, the universe, 
and everything. Magic number in a 3x3x3 magic cube. 


fined by the P; ; (a). Set S(a) be the subset of D(i) 
which corresponds to shuffles beginning with a 
(the reader should satisfy himself that the obvious 
correspondence between shuffles and decks is a 
bijection). Then we set 
Bite Liev(i)e(at) % 
Did (i)es(a) Xi 

A simple inductive check shows that this is the re- 
quired strategy. 


Lemma 2 There is a bijection between decks 
obtained by shuffling a new pack of N cards m 
times or less, and sequences a; for i=1,...,N 
satisfying 
(i) 1<a,;<N; 
(ii) a;# qj whenever i # j; 
(iii) a; can be expressed as the union of p 
subsequences Df, all of which satisfy 
bk — bk, = 1, where p < 2”. 


Proof: Label each card of the new deck with an 
integer from 1 to N, starting from the top and 
working down. Then any deck after m shuffles can 
be represented as a sequence satisfying (i) and (ii). 
Furthermore, the 2” subsets of the deck, such that 
two elements of the same set were in the same 
portion of the pack after every cut (some sets of 
which may be empty), correspond to the subse- 
quences b‘., for a shuffle cannot change the order 
of such a subset. 


Conversely, suppose we are given the subsequence 
bi. We may assume without loss of generality that 


each sequence bj, is the maximal subsequence 
satisfying (iii) and containing b},, otherwise one 
would concatenate subsequences until this were 
true. 


Define a vector v(k) with [log,p] components, 
such that v,(k) is the coefficient of 2‘ in the bi- 
nary expansion of k - 1. Assume also without loss 
of generality that bj, is the least number not equal 
to one of U bj, by ordering the subsequences 
in a canonical manner. 


Then corresponding to this sequence a; we define 
a sequence [log ,p] shuffles. The sequences bf, are 
all in the top half at the j’th cut if vy _;(k) > 0 
and are all in the bottom half otherwise. A sim- 
ple inductive argument (left to the reader) shows 
that such a sequence of shuffles exists and is 
unique. Furthermore, since p is not greater than 
2”, [log .p] it is not greater than m. 


We now proceed to define a sequence of optimal 
shuffling procedures P,, P3, ..., P, such that, if 
{D,,(i), i= 1, ..., k,} is the set of all decks which 
can be obtained by shuffling a new deck n times or 
less, then all the D,,(i) are equally likely after the 
successive application of P,, ..., P,. We define P,, 
inductively: let P,, ..., P,_. be perfect shuffles and 
let M,,(j) be the number of elements of D,,_; (i) for 
i=1,...,k,.;, which can give rise to D,(i) after a 
single cut and shuffle. Let {D,,(kcj), i= 1, ... Lac} 
be the set of all decks obtainable by a sequence 
of n shuffles in which C was the last cut. The we 
define the probability that the cut in P,, is C to be 


Luc 1 
> M,(j) 


By applying Lemma 1 we can construct a shuffling 
strategy such that 


1 


P(cut SC; deck = D,lke,)) = k, M,(ke,;) 
n n Cj 


Summing over all possible cuts we get P(deck = 
D,(kc,)) = 1/k,. Thus we have defined P,,. The 
reader might care, as an instructive exercise, to 
work out P,, the optimal procedure for the first 
shuffle. 


It is now easy to see that the smallest number of 
shuffles necessary to randomise a pack of N cards 
completely is [log ,N]: we consider the deck in 
which the original order of the pack is reversed 
and apply Lemma 2 to see that D,,(i), case P, P, 
... P,, will perfectly shuffle the pack. This is a nice 
result, and what one would expect from informa- 
tion theory. 


The corresponding result for case (ii) mentioned 
in the introductory paragraph would be that one 
requires [log , M] shuffles, where M is the number 
of players. Unfortunately the beauty of this result 
is spoiled by the fact that it is false — the true value 
is [log , f(M,N)], where f(M,N) is the least f such 
that for any M hands of N/M cards Hj for j = 1, 
.... N/M andi =1, ..., M, there exists a sequence 
a, of N integers as defined in Lemma 2, with the 
number of subsequences Dé less than or equal to 
(MLN) and also 


N/M | N-1 
U Hj ~ U Gj+jiu 
j=l j=0 


Unfortunately it is not clear how f(M,N) can be 
evaluated; all that is clear is that M < f(M,N) < 
N. Perhaps one of our readers would care to earn 
himself a place in the hearts of Bridge players eve- 
rywhere by solving this problem? 


Smallest prime number expressible as the 
sum of two, three, fouror five primes. 
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First published in issue 16, 1953 


ost mathematicians know the theory of 
Me game of Nim, described in books 

on mathematical recreations. But few 
seems to be aware of Dr P. M. Grundy’s remark- 
able generalisation, published in Eureka 2 in 1939. 
Consider a game I in which 2 players move alter- 
nately, and the last player wins (moving to a “ter- 
minal position’). Define inductively a function 
G(P) of the position P as follows: 


(a) if P is terminal, G(P) = 0; 


(b) if there are permitted moves from P to Q, 
from P to R, from P to S, and so on, then 
G(P) is the least non-negative integer differ- 
ent from all of G(Q), G(R), G(S), ... 


It follows that if 0 < r < G(P) there is a move from 
P to some R with G(R) = 1, but no move to any 
position U with G(U) = G(P). If positions P with 
G(P) = 0 are called “safe, the winning strategy is 
to move always to a safe position: either this is 
terminal, and wins immediately, or the opponent 
moves to an unsafe position and the cycle repeats. 


Now imagine the players engaging in a “simulta- 
neous display” of k games T,, 1, ..., I, of this sort, 
the rule being that each player in turn makes a 
move in one and only one game, or if he cannot 
move in any game he loses. Let P;, P3, ..., P, be 
the positions in the respective games T',,T, ..., Ty. 
Then Grundy’s Theorem states that 


(i) this combined position is safe if and only if 
k heaps of G(P,), G(P), .... G(P,) counters 
respectively form a safe combination in Nim, 


Number of derangements of 5 items. 
Tribonacci, happy and octahedral number. 
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(ii) more generally, the G function of the 
combined position is the “nim-sum” of the 
separate G(P,), i.e. obtained by writing the 
G(P,) in the scale of 2 and adding columns 
mod 2. 


For no player can gain any advantage by mov- 
ing so as to increase any G(P,), as the opponent 
can restore the status quo. And if only decreases 
in G(P,) are considered, the game is identical 
with Nim, thus proving assertion (i). Therefore 
G(P) = g if and only if the combined position 
(P, P’) is safe, where G(P’) = g. From that (ii) fol- 
lows fairly readily. 


It follows that we can analyse any such combined 
game completely, provided that we can find the 
G(P,) for the component positions. Nim is an 
example; a heap H,, of x counters constitutes a 
component position, since each player in turn 
alters one heap only, and G(H,.) = x. Many vari- 
ants of Nim are similarly analysed. Less trivial is 
Grundy’s game, in which any one heap is divided 
into two unequal (non-empty) parts. Thus heaps 
of 1, 2, are terminal, with G = 0, a heap of 3 can 
only be divided into 2 + 1, which is terminal, so 
G(H;) = 1. Generally G(H,.) in Grundy’s game is 
the least integer > 0 different from all nim-sums 
of G(H,) and G(H,_,) for 0 < +x. The series goes 
012345 67 8 
000102102 


x 9 
G(H,) = 1 
continuing with 0, 2, 13, 2, 1, 3, 2, 4, 3, 0, 4, 3, 0, 4, 
3, 0, 4, 1, 2, 3, 1, 2, 4, 1, 2, 4, 1, 2, ... This curious 


“somewhat periodic series” seems to be trying to 


HN 
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have period 3, but with jumps continually occur- 
ring. Richard Guy confirmed this up to x = 300. 
He suggested that it might be played on a piano, 


taking 0 to be middle C, |= D, 2 = E, etc. The inner 
meaning then became evident: 


GI \ 

K 

=e i © ow og 
eee oe . oe oe oe “2 oe > Ss 


e 


> 
Di viding helpsin different parts, different parts. Not the easiest of the arts: No, no, no! (Just you try.) No, no, no! 


Guy also worked with rows R,, of x counters, in 
which certain sets of consecutive counters could 
be extracted (thus possibly leaving two shorter 
rows, one each side of the extracted set). In his “+6” 


game, any one counter can be removed, except an 
R, (= a single counter standing on its own). The 
G(R,,) series (x = 1, 2, ...) is a waltz. (Note that 
some notes span two bars.) 


i 5 = = : s x 
goo oe 7 o a oe zo Se e 5 
If I’m al one, all on my own, there] must al ways _ stay. But if I touch a no ther such, 
4 oe = = oe oe | 
oe ole : oe oe e z oe o - : 
co) Ve # o oe 
I maybe taken a way. Andas a boon, this lit tle tune shows you the right move to _ play. 


But at this point the tune completely broke down. 
I asked Guy if he could think of any reason for 


that. He said, “Yes, an error I made in the calcula- 
tion” After correction the waltz proceeds: 


os > ‘ 
= oS, «1 = | eo ie 


weg te +e ew 1 oe ee oe | 


This series still quite baffles me. The gener- term 


This tries to be periodic with period 26, but jumps 
keep appearing. Many other such games give 
tuneful, somewhat periodic series, for no evident 
reason. Guy discovered two curious exceptions: 
his “4”, remove 1 counter not at the end of a row, 
has exact period 34 for x > 54, and Kayles, remove 


a 
I cannot see. P’haps it just wanders a long aim less ly. 


lor 2 adjacent counters, has exact period 12 for x > 
71. Thus these games have a complete analysis. But 
generally it might be helpful to bring in a profes- 
sional musician to study number theory. Perhaps 
a thorough study of the Riemann Hypothesis will 
uncover the Lost Chord. After all, why not? 


Kaprekar number, since 457 = 2025 and 20+ 25 = 45. 
Triangular, hexagonal and 16-gonal number. 
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The Logic of Logic 


Zoe Wyatt, Newnham 


barked on a quest to find a secure foundation 
for their subject, based on the use of axioms 
and rigorous logic. 


I n the early 20th century mathematicians em- 


By itself however, a collection of axioms is not 
very useful, since it does not generate anything on 
its own. Only in conjunction with some logic, the 
rule of inference for example, do axioms lead to 
results. 


The rule of inference, or modus ponens, says: 


P=>Q,P 
a. 


In this and any mathematical expression, we use 
symbols to express ourselves. As with words in 
language and expression in conversation, we rely 
on ‘tools’ to convey the substance of our thoughts. 
Of course with words we can find inconsistencies: 


I fit into my shirt. 
My shirt fits into my bag. 
‘Therefore I fit into my bag. 


Though trivial, this shows that words have an un- 
derlying associated meaning. With this restriction 
in mind, we could fix the above by replacing ‘bag’ 
with ‘very large wardrobe’ 


Mathematics avoids such a restriction altogether; 
swapping P with Q in the modus ponens would 
still yield the same results. Obvious you might 
think, but philosophically this structural differ- 
ence is of great importance. 


Number of human Chromosomes. 
Centred triangular number. Erdés-Woods number. 


Hilbert’s Finitism 


Start with the statement: V x € Z, $(x) is true, 
where ¢(x) can be precisely one of ‘true’ or ‘false’ 
If we negate this statement, would you imagine 
checking an infinite number of x’s for falsity in ¢? 
Or perhaps spot a suspicious looking x and prove 
him to be a counterexample? 


In the early 1920s, Hilbert was losing sleep on such 
matters. Or to be precise, he was concerned with 
making meaningful propositions and methods of 
reasoning which did not require the acceptance 
of infinite entities. This finitary viewpoint is par- 
ticularly important in the context of mathemati- 
cal operations, by only allowing arguments which 
can be translated into a finite set of propositions 
starting from a finite set of axioms. 


Of particular concern was the Quantifier Law of 
Excluded Middle (QLEM): 


Every x satisfies , or some x 
satisfies the negation of ¢, 


where ¢ is again a statement which is either true 
or false. 


Hilbert held a finitary view, meaning that if the 
domain being tested was infinite, the QLEM was 
not to be trusted. How could he know the value of 
@ (x) for any one of an infinite of x’s? More gener- 
ally, the finite belief prevented simultaneously al- 
lowing a property to be associated with infinitely 
many objects. In our case, it means we cannot ap- 
ply an infinite conjunction to the integers: 


$(1) and ¢(2) and ¢(3) ... 


If instead we are equipped with only finite pro- 
cedures, then given a particular integer, we are 
able to prove the instance of é(n) for precisely that 
case. Hence negating this statement is not univer- 
sal. So not $(n) indicates that every instance of ‘n 
fails property ¢ is true, but it does not tell us that 
not every instance of $() is true. Put more simply, 
the following statements are not universally logi- 
cally equivalent: 


« Not every x satisfies (x); 


« Some x satisfies not-¢(x). 


Thus QLEM fails. More generally though, this 
shows that problems in our base assumptions 
need to be addressed to prevent ramifications fur- 
ther on. 


Truth and its Limitations 


Also during the late 19th and 20th Centuries, 
many mathematicians began to question the lim- 
its on what kind of mathematical objects could 
be represented and manipulated. Aside from the 
popular Gédel’s Theorem of Incompleteness, and 
Russell's Paradox, a key if slightly less popular re- 
sult is Tarski’s Undefinability Theorem. Published 
in 1936, this (very informally) states: 


Given a formal arithmetic, a true arithmetical 
statement cannot be defined in that arithmetic. 


To explain the original, technical form of this the- 
orem would take too long here [2], however we 
see from the above statement of the theorem that 
formal languages containing semantic terms like 
“true’, will always give a paradox when these terms 
are self-referenced. Tarski addressed this by mak- 
ing the distinction between semantically closed 
and semantically open languages. He defined a 
semantically closed language to be one in which 
it is possible for a single sentence to predicate (de- 
termine) truth or falsehood in another sentence 
in the same language, or even of itself. Put simply, 
a semantically closed language can apply seman- 
tic properties to the terms that express semantic 
properties. 


This suggests that for a semantically open lan- 
guage to achieve consistency, we need to use a 
more powerful language, called a metalanguage, 
in order to be able to define a truth predicate. One 
of the most common uses of metalanguages is in 
computer science, such as the Backus-Naur Form 
developed in the 1960s, to describe the syntax of 
computer programming languages. 


David Hilbert (1862 - 1943) 


So Have We Done Anything? 


Understanding what makes a statement true or 
false, and how our mathematics relates to our 
thoughts, has many times uncovered the limita- 
tions of underlying assumptions. Such question- 
ing often leads to fruitful ways of new thinking, a 
key example being the development of hyperbolic 
geometry in the 19th Century by the rejection of 
Euclid’s 2000 year old parallel line axiom. 


Similarly the work of Hilbert, Tarski and their 
contemporaries’ had large ramifications in not 
only mathematics, but also in philosophy and se- 
mantics. Indeed the year after Hilbert published 
his foundation of classical mathematics, the phi- 
losopher Wittgenstein wrote extensively on the 
limitations of language, making the famous com- 
ment: 


‘The limits of my language, mean 
the limits of my world. 


If mathematics is our language, where do you 
think we are limited? 


References 


1. P. Davis, R. Hersh, The Mathematical Experi- 
ence, Pelican Books, New York (1981) 

2. M.Giaquinto, The Search for Certainty - A 
Philosophical Account of Foundations of 
Mathematics, OUP (2006) 

3. J. Hintikka, The Philosophy of Maths. Lan- 
guages in which Self Reference is possible, R. 
M. Smullyan, OUP (1969) 


Not a palindrome in any base 6 for2 <6 < 45. 
Atomic number of silver. Carol number. 


47 


Alan Turing Year 
All 2012 


100 years since this great 
British mathematician and 
computer scientist was born. 
Known for his contributions to 
computability theory, crack- 
ing the as Enigma in World 
War Il and the Turing machine, 
he died at the age of 41. 


Crafoord Prize 
January 2012 


Awarded to the Fields Medallists 
Terence Tao and Jean Bourgain. 
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All 2012 
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Shaw + Physics Prizes 
May and July 201 


Russian Fields Medal- 
list Maxim Kontsevich 
received the 2012 
Shaw Prize in Math- 
ematical Sciences. He 
was also one of the 

9 recipients of the 
Fundamental Physics 
Prize awarded for the 
first time this year. 


e Abel Prize 


Awarded, by the King of 
Norway, to the Hungarian- 
American mathematician 
Endre Szmeredi “for his 
fundamental contributions 
to discrete mathematics and 
theoretical computer science, 
and in recognition of the 
profound and lasting impact 
of these contributions on 
additive number theory and 
ergodic theory.” 


| Discovery of the Higgs Boson 


| 4 July 2012 


a 


CERN announced findings of a new par- 
ticle that was consistent with the pre- 

dicted Higgs boson. Evidence for this 

so-called ‘God Particle’ has been sought 
after for many decades, and if this really 
was a Higgs, its existence would explain 
many mysteries of the universe, includ- 
ing how matter attains mass. 


ABC Conjecture 


August 2012 


Japanese mathematician Shinichi Mo- 
chizuki has published a proof online. It 
is still under verification. 


Let a, b, c be relatively prime integers with 
a+b=c. Then forany€ > 0, there is some 
C, such that max(|a], |b}, |c|) < Ce TT pjabe 
p'** when p is prime. 

Far reaching consequences include 
Roth's Theorem, Fermat's Last Theorem 
and the Mordell conjecture. 


21/12/2012 


According to widely ac- 
cepted arithmetic and astro- 
nomical theories, the world 
will ... possibly ... probably 
... Not end. 


Junk DNA 


September 2012 


OK - not mathematics but a 
great discovery. While over 
98 % of the human genome 
had been previously thought 
to serve no purpose, and so 
called junk DNA, the ENCODE 
project released 30 papers 
disproving this to reveal that 
over 80% perform vital func- 
tions in the body. 


Type 1/49 in your calculator Xs) 
and you will be very surprised... 
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Smallest number that can be written as the 
sum of of two squares in two distinct ways. 


Hopes 
and Fears 


Paul Dirac 


First published in issue 32, 1969 


research worker who is actively follow- 

ing up some idea referring to the funda- 

mental problems of physics has, of course, 
great hopes that his idea will lead to an important 
discovery. But he also has great fears — fears that 
something will turn up that will knock his idea on 
the head and set him back to the starting point in 
his search for a direction of advance. Hopes are 
always accompanied by fears, and in scientific 
research the fears are liable to become dominant. 


As a result of these emotions the research worker 
does not proceed with the detached and logical 
mind that one would expect from someone with 
scientific training, but is subject to various re- 
straints and inhibitions which obstruct his path 
to success. He may delay taking some step liable 
to force a rapid show-down, and may prefer first 
to nibble at side-issues that provide a chance of 
achieving some minor successes and gaining a lit- 
tle strength before facing the crisis. 


For these reasons the innovator of a new idea is 
not always the best person to develop it. Some 
other person without the fears of the innovator 
can apply bolder methods and may make a more 
rapid advance. In the following there will be some 
examples that illustrate this situation. 


Anyone who has studies special relativity must 
have wondered why it was that Lorentz, after he 
had obtained correctly all the equations of the 
Lorentz transformation, did not then take the 
perfectly natural step of considering all frames 
of reference to be on the same footing and so ar- 


riving at the relativity of space and time. History 
does not record just what it was that held Lorentz 
back, but it can only have been some kind of fear, 
perhaps a subconscious one. He did not dare to 
venture out into a domain of thought completely 
foreign to anything that anyone had ever imag- 
ined. He preferred to remain on the solid round 
of mathematical transformations, where his posi- 
tion was unassailable. It needed the boldness of a 
younger man such as Einstein to take the plunge 
into a new domain. 


The innovator of our present quantum mechanics 
was Heisenberg. At a time when atomic physicists 
were floundering about with the orbits of Bohr- 
Sommerfeld theory and feeling the need for a 
drastic alteration of basic principles, Heisenberg 
has the brilliant idea of constructing a new theory 
entirely in terms of observable quantities, quan- 
tities connected with observations on spectra. 
These are each connected with two atomic states, 
and the natural way of expressing them is in the 
form of matrices. Thus Heisenberg was led to con- 
sider matrices as dynamic variables. 


He had not proceeded far in developing this 
idea before he noticed that his dynamical vari- 
ables would not satisfy the commutative law of 
multiplication. This was most disturbing. It was 
inconceivable to a physicist in those days that 
dynamical variables could be any other than or- 
dinary algebraic quantities, and with the appear- 
ance of non-commutation Heisenberg had grave 
fears that his whole beautiful idea would have to 
be given up. 


When I read Heisenberg’s first paper on the sub- 
ject, I had the advantage over him in not having 
his fears, as it was not my own idea that was at 
stake. I was therefore able to look at the question 
from a more detached point of view. 


I needed only a week or two to realize that the 
non-commutation which alarmed Heisenberg 
was really the dominating feature of the new 
theory. The idea of building up a theory entirely 
in terms of experimentally observed quantities, 
although a very pleasing philosophical doctrine, 
was of only secondary importance for the pur- 
pose of establishing a new dynamics. 


My early work on quantum mechanics was thus 
concentrated on the problem of bringing non- 
commutation into dynamical theory. It was not 
really very difficult, because the previous atomic 
theory, the orbit theory of Bohr and Sommerfeld, 


was based on a form of dynamics, Hamilton's 
form, which turned out to be specially suitable for 
adapting to non-commutative algebra. 


Heisenberg continued to develop his theory, in 
collaboration with other people in Géttingen. I 
worked independently from them, apart from 
getting the initial idea from Heisenberg. We pub- 
lished papers at about the same time, setting the 
foundations for quantum mechanics. Our styles 
were different on account of the different points 
of view we held, mine being based on non-com- 
mutation and Heisenberg’s on the use of matrices 
built up from observable quantities. 


Quantum mechanics was discovered quite inde- 
pendently by Schrodinger, working on entirely 
different lines. He had his own difficulties. He 
was thinking over the mathematical connection 
between waves and particles that had been dis- 
covered some time previously by de Broglie, and 
eventually found a way of generalizing it to apply 
to an electron moving in an electromagnetic field. 
He then had a very beautiful wave equation, con- 
forming to relativity. He proceeded to apply it to 
the hydrogen atom and his worst fears were real- 
ized. The results did not agree with observation. 


We know now that the discrepancy was due to 
the spin of the electron, which was unknown to 
Schrédinger at the time, although the experimen- 
talists had begun to suspect it. It was a most de- 
pressing situation for Schrédinger, and led him to 
abandon the work for some months, and eventu- 
ally to publish it only in the non-relativistic ap- 
proximation, in which the discrepancy does not 
show up. The relativistic equation was later re- 
discovered by Klein and Gordon, who were not 
afraid to publish an equation in disagreement 
with observation, while Schrodinger was. So the 
equation now bears their name. It has some value 
in describing spinless mesons. 


Schrédinger’s quantum mechanics was soon 
found to be equivalent to that originated by 
Heisenberg, in spite of their first seeming so dif- 
ferent. The basic equations of the new mechanics 
were securely established, and it became neces- 
sary to find a physical interpretation for them. 
With non-commutative algebra it could not be as 
direct as in the classical theory. The general physi- 
cal interpretation was found to be only a statistical 
one. One could calculate probabilities, but could 
not usually predict an event with certainty. 


One of few to be both a pentagonal 
number and a centred pentagonal number. 
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A difficulty now appeared in connection with the 
relativistic equation of Klein and Gordon. The 
theory sometimes gave negative probabilities. It 
was a satisfactory theory only when it was used 
non-relativistically. I puzzled over this for some 
time and eventually thought of a new wave equa- 
tion which avoided the negative probabilities. I 
found that it also gave automatically the spin of 
the electron, a most gratifying result. I proceeded 
to apply the new equation to the hydrogen atom, 
taking into account the relativistic corrections 
only to the first order of accuracy to simplify the 
calculations, and found agreement with observa- 
tion. 


The natural thing to do at this stage would have 
been to continue to higher orders of accuracy, but 
I did not do so. I was scared that they might not 
agree with observation. I hastily wrote up a paper 
with merely the first order of accuracy and pub- 
lished that. In doing so I felt I was consolidating a 
limited success, and even if the higher orders did 
go wrong there would still be something to stand 
on. It was left to Darwin, who did not share my 
fears to carry out the calculation to all orders of 
accuracy and see that the results were alright. 


In my first paper on the subject (Proc. Roy. Soc A 
117, page 610) there occurs the equation 


2 
< Ao} t 
c 


The relativist, if he sees this equation nowadays, 
will say at once: there is a mistake here. The plus 
signs before the second and third terms on the 
right should be minus’. He will wonder how such 
a conspicuous mistake could have remained un- 
detected in the proof-reading. He will wonder still 
more when he sees the same mistake perpetuated 
in later equations. 


The explanation is that there is really no mistake 
and things were published as the author intended. 
The plus signs were the expression of a fear. At 
that time relativity was still unfamiliar and peo- 
ple had continually to cling to the symmetry of 
space and time so as not to let it out of their heads. 
The symmetry becomes perfect only if one uses a 
time variable which is V—1 times the usual time 
and makes a corresponding change in all 4-vec- 


Number of paying cards in one deck 
and number of white keys on a piano. 


tors. With this notation there are no mistakes in 
the paper. This notation was frequently used in 
those days, and it was not considered necessary 
to explain it every time it was used, because the 
context made it clear. The arrival of the new wave 
equation rather forced one to give it up, as it then 
became too clumsy. 


The new wave equation led to a difficulty in that 
it allowed states of negative energy for the elec- 
tron. Negative energies are never observed, but 
they could not be ignored in the theory. I thought 
of a way of coping with them, namely, to assume 
that in the physical world all or nearly all of the 
negative-energy states are occupied, so that ordi- 
nary positive-energy electrons cannot jump into 
them. An unoccupied negative-energy state is a 
hole which appears as a particle with a positive 
energy and a positive charge. 


Right from the beginning I had the feeling that 
there would be symmetry between the holes and 
the electrons. This feeling was strengthened by the 
knowledge that in the chemical theory of the va- 
lency of atoms, there is a considerable amount of 
symmetry between an electron lying outside the 
closed shells and a hole in a closed shell. I did not 
want the symmetry. At that time it was believed 
that all positive charges were in protons, and 
the proton was much heavier than the electron. 
So I struggled with the hope that in some way 
the Coulomb interaction between the electrons 
would lead to a dissymmetry between the holes 
and the electrons, and was afraid that if this hope 
should fail the whole idea would have to be aban- 
doned. It was left to others, in particular Weyl and 
Oppenheimer, to make the bold assertion that 
mathematical symmetry demanded that the holes 
should have the same mass as the electrons. 


With these developments the theory of single par- 
ticles was put into order. There remained prob- 
lems concerned with interaction. If one sets up 
precise relativistic equations one finds that the 
interaction is so violent that the equations do not 
have any solutions. The difficulties are still not 
satisfactorily resolved and point to the need for 
some further drastic change in the foundations of 
atomic theory. 


Smallest prime number that does not 
divide the order of any sporadic group. 
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Quantum Gravity 


Stephen Hawking, DAMTP Cambridge 


First published in issue 32, 1969 


he interactions that one observes in the 

physical universe are normally divided 

into four categories according to their bo- 
tanical characteristics. In order of strength they 
are, the strong nuclear forces, electromagnetism, 
the weak nuclear forces and, the weakest by far, 
gravity. The strong and weak forces act only over 
distances of the order of 107 cm or less and so 
they were not discovered until this Century when 
people started to probe the structure of the nu- 
cleus. On the other hand electromagnetism and 
gravity are long range forces and can be readily 
observed. They can be formulated as classical, ice. 
non quantum, theories. Gravity was first with the 
Newtonian theory followed by Maxwell's equa- 
tions for electromagnetism in the 19th Century. 
However the two theories turned to be incom- 
patible because Newtonian gravity was invariant 
under the Galilean group of transformations of 
inertial frames whereas Maxwell equations were 
invariant under the Lorentz group. The famous 
experiment of Michelson and Morley, which 
failed to detect any motion of the Earth through 
the luminiferous aether that would have been re- 
quired to maintain Galilean invariance, showed 
that physics was indeed invariant under the Lor- 
entz group, at least, locally. It was therefore neces- 
sary to formulate a theory of gravity which had 
such an invariance. This was achieved by Einstein 
in 1915 with the General Theory of Relativity. 


General Relativity has been very successful both 
in terms of accurate verification in the solar sys- 
tem and in predicting new phenomena such as 


black holes and the microwave background radia- 


tion. However, like classical electrodynamics, it 
has predicted its own downfall. The trouble arises 
because gravity is always attractive and because it 
is universal ie. it affects everything including light. 
One can therefore have a situation in which there 
is such a concentration of matter or energy in a 
certain region of space-time that the gravitational 
field is so strong that light cannot escape but is 
dragged back. According to relativity, nothing can 
travel faster than light, so if light is dragged back, 
all the matter must be confined to a region which 
is steadily shrinking with time. After a finite time 
a singularity of infinite density will occur. 


General Relativity predicts that there should be a 
singularity in the past about 10,000 million years 


ago. This is taken to be the “Big-Bang”, the begin- 


ning of the expansion of the Universe. The theory 


also predicts singularities in the gravitational col- 


lapse of stars and galactic nuclei to form black 
holes. At a singularity General Relativity would 
lose its predictive power: there are no equations to 


govern what goes into or comes out of a singular- 
ity. However when a theory predicts that a physi- 


cal quantity should become infinite, it is generally 
an indication that the theory has broken down 
and has ceased to provide an accurate description 


of nature. A similar problem arose at the begin- 


ning of the Century with the model of the atom as 
a number of negatively charged electrons orbiting 
around a positively charged nucleus. According 
to classical electrodynamics, the electrons would 
emit electromagnetic radiation and would lose 
energy and spiral into the nucleus, producing a 
collapse of the atom. The difficulty was overcome 


Sum of the first 10 integers and 10th Fibonacci number. 
Largest number that is both triangular and Fibonacci. 
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by treating the electromagnetic field and the mo- 
tion of the electron quantum mechanically. One 
might therefore hope that quantisation of the 
gravitational field would resolve the problem of 
gravitational collapse. Such a quantisation seems 
necessary anyway for consistency because all oth- 
er physical fields appear to be quantised. 


So far we have had only partial success in this 
endeavour but there are some interesting results. 
One of these concerns black holes. According to 
the Classical Theory the singularity that is pre- 
dicted in the gravitational collapse will occur in 
a region of space-time, called a black hole, from 
which no light or anything else can escape to 
the outside world. The boundary of a black hole 
is called the event horizon and acts as a sort of 
one way membrane, letting things fall into the 
black hole but preventing anything from escap- 
ing. However, when quantum mechanics is taken 
into the account, it turns out that radiation can 
“tunnel” through the event horizon and escape 
to infinity at a steady rate. The emitted radia- 
tion has a thermal spectrum with a temperature 
inversely proportional to the mass of the black 
hole. As the black hole emits radiation, it will loss 
mass. This will make it get hotter and emit more 
rapidly. Eventually it seems likely that the black 
hole will disappear completely in a tremendous 
final explosion. However the time scale for this to 
happen is much longer than the present age of the 
Universe, at least for black holes of stellar mass, 
though there might also be a population of much 
smaller primordial black holes which might have 
been formed by the collapse of irregularities in 
the early Universe. 


One might expect that vacuum fluctuations of 
the gravitational field would cause “virtual” black 
holes to appear and disappear. Particles, such as 
baryons, might fall into these holes and be radi- 


Sum of the first six triangular numbers and the first 
six odd primes. Tetranacci and Pronic number. 


ated as other species of particles. This would give 
the proton a finite lifetime. However it is difficult 
to discuss such processes because the standard 
perturbation techniques, which have been suc- 
cessful in quantum electrodynamics and Yang- 
Mills theory do not work for gravity. In the former 
theories one expands the amplitudes in a power 
series in the coupling constant. The terms in the 
power series are represented by Feynmann dia- 
grams. In general these diverge but in these theo- 
ries all the infinities can be absorbed in a redefi- 
nition or “renormalisation” of a finite number of 
parameters such as coupling constants as masses. 
However in the case of gravity, the infinities of dif- 
ferent diagrams are different and so they would 
require an infinite number of renormalisation 
parameters whose values could not be predicted 
by the theory. In fact the situation is not really 
that much worse than with the so-called renor- 
malisable theories since even with them the per- 
turbation series is only asymptotic and does not 
converge, leaving the possibility of adding an ar- 
bitrary number of exponentially vanishing terms 
with undetermined coefficients. 


The problem seems to arise from an uncritical 
application of perturbation theory. In classical 
general relativity it has been found that pertur- 
bation expansions around solutions of the field 
equations have only a very limited range of valid- 
ity. One cannot represent a black hole as a per- 
turbation of flat space-time yet this is what sum- 
ming Feynmann diagrams attempts to do. What 
one needs is some approximation technique that 
will take into account the fact that the gravita- 
tional field and the space-time manifold can have 
many different structures and topologies. Such a 
technique has not yet been developed but we, at 
Cambridge, have been approaching the problem 
by studying the path integral approach formula- 


tion of quantum gravity. In this the amplitudes 
are represented by an integral over all metrics 


J Plslexp(-itg) 


where D[g] is some measure on the space of all 
metrics g and I[g] is the action of the metric g. 


If the integral is taken over real physical metrics 
(that is, metrics of Lorentzian signature - + + +), 
the action I is real so the integral oscillates and 
does not converge. To improve the eigenvalues 
one does a rotation of 90° in the complex t-plane. 
This makes the metric positive definite (signature 
+++ + and the action J pure imaginary so that the 
integral is of the form 


f Plslexp(-tls1) 


where f = —iI. The Euclidean action f has certain 
positive definite properties. 


One is thus led to the study of positive definite 
metrics (particularly solutions of the Einstein 
equations) on four-dimensional manifolds. If the 
manifolds are simply connected, their topology 
can be classified (at least up to homotopy) by two 
invariants, the Euler number as measuring the 
number of holes or gravitational instantons and 
the signature measures the difference between 
right-handed instantons and left-handed ones. 
It seems that the dominant contribution to the 
path integral comes from metrics with about one 
instanton per Planck volume 107? cm?s. Thus 
space-time seems to be very highly curved and 
complicated on the scale of the Planck length 
10 cm, even though it seems nearly flat on larg- 
er scales. 


However we still do not have a proper scheme for 
evaluating the path integral. The difficulty lies in 
defining a measure D[g] on the space of all met- 


rics. In order to obtain a finite answer it seems 
necessary to make infinite subtractions and these 
leave finite undetermined remainders. There is a 
possible way of overcoming this difficulty which 
may come from an extension of General Relativ- 
ity called supergravity. In this the spin 2 graviton 
is related to a spin 3/2 field and possibly fields 
of lower spin by anticommuting “supersymme- 
try” transformations. In these theories there is an 
equal number of bosons (integer spin particles) 
and fermions (half integer spin particles). The 
infinities that arise in the path integral from the 
integration over boson fields seem to cancel when 
the infinities that arise from the integration over 
the fermion fields, raising the hope that one could 
provide a proper mathematical definition of the 
path integral, maybe some limiting process. 


Supergravity theories have another very desirable 
feature, they may unify gravity with the other in- 
teractions and particles in physics. In 1967 Salem 
and Weinberg proposed a unified theory of the 
electromagnetic and weak interactions. This has 
had considerable success in predicting experi- 
mental results though the final confirmation will 
have to wait for the next generation of particle ac- 
celerators. Nevertheless, it has given great stimu- 
lus to attempts to unify the strong, the weak and 
the electromagnetic interactions into a “Grand 
Unified Theory”. A feature of such theories is that 
the complete unification is seen only at the very 
high energies of the order of 10!° Gev, at which 
quantum gravitational effects should become im- 
portant. It may well be therefore that one will be 
able to achieve the unification only by incorporat- 
ing gravity as well in a completely unified theory 
which would describe all of physics. This was 
the goal to which Einstein devoted the last thirty 
years of his life, without much success. The pros- 
pects look brighter now though it is still probably 
quite a long way off. 


The 57-cell is a self-dual abstract regular four- 
dimensional polytope. It has symmetry group L,(19). 
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r | “A here has been a recent flurry of articles and 
books proposing that we live in a multiverse 
rather than a universe: there is not one uni- 

verse but many (Rees 1999, Rees 2000, Carr 2008, 
Greene 2011). This raises key issues about the 
validity and utility of mathematical models, and 
their relation to what exists. Mathematical mod- 
els must be coherent as models, based in what- 
ever the underlying physics is; but that is not by 
itself sufficient to make them physically relevant. 
If they are meaningful as mathematical models of 
some physical system, they must be applicable to 
that context; which means you need to be able to 
test them and see if they describe the system well. 
If there is no possible way to test them, you have 
a problem: it is unclear whether they are indeed 
reliable models of reality. And that is a major issue 
that arises as regards multiverse theories . 


A variety of kinds of multiverses have been en- 
visaged by many authors. In his recent book The 
Hidden Reality (Greene 2011), Brian Greene pro- 
poses nine different types of multiverse theories: 


1. Existence beyond the horizon: Invisible 
parts of our own universe. 


2. Chaotic inflation, leading to different ex- 
panding domains in separate places. 


3. Brane worlds of M-theory (Four-dimension- 
al space-times embedded in higher dimen- 
sional spacetimes). 


4. Cyclic universes, leading to different ex- 
panding domains at different times. 


5. The Landscape of string theory embedded in 
a chaotic cosmology. 


Number of stellations of an icosahedron. 
Number of orthorhombic space groups. 
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A Figure 1 Space Time Diagram — Normal Distance and Time 


6. The Everett quantum multi-universe: other 
branches of the wave function. 


7. Holographic projections (currently a trendy 
proposal in cosmology). 


The universe is a computer simulation. 


9. All that can exist must exist—the “grandest 
of all multiverses’, the separate universes be- 
ing totally disjoint from each other. 


Now one thing is clear - they can’t all be true, for 
they conflict with each other. There remains the 
final possibility: 


10. Maybe none of them is true - there is just 
one universe. 


I will concentrate on the most popular one: cha- 
otic inflation (2), usually coupled with the land- 
scape of string theory (5). I will show firstly that 
there is no way to directly verify that this model 
is true, and secondly that it is not based in well 
understood and verified physics. Hence while it 
may possibly be true, it has not been proved so, 
and indeed that proof may well be impossible. 
The reason we can't prove a multiverse exists ob- 
servationally is due to the nature of its spacetime 
structure, which on a large scale is governed by 


Order of the smallest non-abelian simple group. 
Number of minutes in one hour and one degree. 
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Einsteins General Relativity Theory. When we 
model the large scale structure of the universe, 
our cosmological models are surprisingly simple: 
they assume a basic structure that is both spatial- 
ly homogeneous (all physical quantities are the 
same everywhere at the same cosmic time) and 
spatially isotropic (there are no preferred direc- 
tions in the sky when we average matter on large 
enough scales). This geometry is represented by 
the metric of the spacetime (see Appendix), which 
has a scale factor a(t) representing the change of 
distance between galaxies with time, whose time 
evolution is determined by the Einstein’s gravita- 
tional field equations, depending on the matter 
and radiation content of the universe. The met- 
ric also determines the paths of photons through 
spacetime, and so in particular determines the 
size of the visual horizon as a function of cosmic 
time. 


To understand this properly one of course needs 
to contemplate the equations of the theory, given 
in the Appendix. However we can also under- 
stand its relevant properties straightforwardly 
from spacetime diagrams showing how causal 
relations work in these models. The way such 
diagrams work, and their relation to the under- 
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A Figure 2 Space Time Diagram - Comoving Distance and Conformal Time 


lying spacetime metric, is discussed in detail in 
my book with Ruth Williams (Ellis and Williams 
1995). The relevant details are as follows. 


Figure 1 is a space time diagram representing spa- 
tial distances and cosmic time correctly. Time is 
plotted vertically, and distance horizontally. The 
start of the universe is at t = 0; galaxy world lines 
diverge from each other since then. Our galaxy 
world lie is at the centre (r = 0); the present is 
labelled “here and now”. Our past light cone is 
marked in red; this is the path through spacetime 
of photons that are reaching us now. Going back 
into the past, it reaches a maximum radius and 
then contracts back to the big bang singularity; 
this is basically because gravity bends light - one 
of Albert Einstein’s major discoveries. 


Now the problem with that diagram is we can't 
see causal relations near the big bang very well. 
We can correct that by changing to stretched dis- 
tance and time coordinates, that transform the 
past light cones to lines at + 45° and matter world 
lines to vertical lines (Figure 2). This is allowed 
because Einstein’s theory allows the use of any co- 
ordinates whatever to represent a given spacetime 
(this is the principle of general covariance). The 


initial singularity - a point in Figure 1 - is then 
represented by the horizontal line at the bottom. 
One should note that this singularity is not part of 
spacetime - it is the boundary of spacetime. This 
diagram also shows something not represented 
in Figure 1: the dotted horizontal line just above 
the singularity. This represents the surface of last 
scattering (“LSS”), where matter and radiation 
decoupled from each other in the early universe. 
This is the furthest back that we can see, because 
the universe was opaque to radiation at earlier 
times. Hence any earlier physics - the way the 
universe was created, the subsequent inflationary 
era - is not visible to us (this is similar to the way 
the surface of the Sun hides its interior from us). 
Most importantly, whatever that earlier physics 
was does not affect light propagation since decou- 
pling of matter and radiation at the LSS: hence the 
causal limits on what we can see are unaffected by 
any such earlier physics. 


Now the key point is that there is a furthest set of 
matter we can see by any electromagnetic radia- 
tion whatever; its world lines are marked as the 
Visual Horizon on the right. It is the world lines 
of matter that pass through the intersection of our 
past light cone with the LSS. This matter emit- 


Three times Fortunate number. Maybe the largest prime 


dividing the product of the next two primes plus 1. 
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ted the cosmic blackbody background radiation 
detected by the WMAP satellite, so the famous 
microwave background anisotropy map (Figure 
3) is just the image we get of this matter at the LSS. 
Any further out matter cannot be seen or detected 
by us by means of any radiation whatever (assum- 
ing no radiation moves faster than light - a key 
feature of relativity theory). The causal horizon 
(marked here as “present day horizon”) lies fur- 
ther out, and depends on early physics. 


In order to get a better picture of our observa- 
tional limits, we need to step back a bit and see 
the bigger context for Figure 2, depicted in Figure 
4: the whole of Figure 2 is the left hand triangle 
there. We cannot detect matter outside there by 
any means whatever. Hence we have no means 
of telling what conditions are like inside the pre- 
sumed universe domain on the right (the same 
size as our entire visible universe domain on the 
left). Physics there might be the same as here, or it 
might be totally different. There is simply no way 
we can ever find out. 


So here is the basic problem for multiverse propo- 
nents: no observational data whatever are avail- 
able to verify their claims of distant universe do- 
mains out there with different physics than in our 
domain. If the basis of science is verifying theo- 
ries by observation, then multiverse theories are 
not science. The assumption made in those theo- 
ries is that we that can extrapolate to 100 Hubble 
radii, 101 000 Hubble radii, or much much more 
(the word ‘infinity’ is casually bandied about) to 
determine in broad terms what conditions are 
like there. That’s not testable science. 


But there is another line of argument. Maybe one 
can justify the multiverse assumption if is a neces- 
sary outcome of known and tested physics, even if 
one cannot directly verify its existence. This is in- 
deed a sound line of reasoning. The problem with 
it is that several aspects of the physics supposed 
to lie the multiverse are hypothetical rather than 
well established: they are major extrapolations of 
known physics into the unknown, and those ex- 
trapolations may or may not be true. This issue is 
discussed in depth by Banks (2012), who shows 
quite clearly that none of the supposed physics 
(Coleman-de Luccia tunneling, the landscape of 
string theory, the supposed connection between 
chaotic inflation and string theory vacua) is well 
established physics. 


A third line of argument is that existence of a 
multiverse explains anthropic coincidences in 


ay ed <q Figure 3 The Cosmic Background Radiation 
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A Figure 4 The entire visible universe is a tiny fraction of the claimed multiverse. 
Most of its regions (if they exist) are not observationally accessible to us by any means. 


cosmology: why the universe is a suitable place 
for life to exist, in particular explaining the value 
of the cosmological constant (the “dark energy” 
currently causing the universe to accelerate). This 
case is made for example by Martin Rees (1999, 
2001), see also Carr (2009). Now it does indeed 
provide such an explanation. Does this therefore 
justify belief in a multiverse? Yes if you think 
theory trumps observational testing in a scientific 
theory; no otherwise. Key philosophical issues 
about the nature of scientific theories underlie 
this choice; a discussion is given in Ellis (2006). 


There are however two exceptions to this gloomy 
picture re testability of the multiverse idea. The 
first is the possible existence of “small universes”: 
universes where the spatial sections are spatially 
closed on a scale smaller than that of the visual 
horizon (Lachieze-Ray and Luminet 1995). In 
that case the horizontal axis of Figure 2 would 
close on itself on a scale smaller than that of the 
visual horizon, and we would already have seen all 
the matter there is in the universe, thus disproving 
the multiverse hypothesis. This intriguing possi- 
bility can be tested in various ways, in particular 
by searching for identical circles of temperature 
fluctuations in the CMB sky. This search has so 
far proved unsuccessful: this remains a possibility, 
but is perhaps unlikely. 


The second exception would be if there were colli- 
sions between different bubbles in the multiverse, 
resulting in detectable disk-like patterns in the CB 
sky. If such bubble collisions were detected and 
additionally could be associated with a variation 
of physics in the different bubbles, for example 
different values of the fine structure constant, this 
could legitimately be taken as vindication of the 
physics supposed to underlie the multiverse pro- 
posal. This has so far not been observed. 


A final comment relates to the issue of infinities. 
It is often said that infinities of universes occur in 
the multiverse (see for example Vilenkin 2007). 
This is a very dubious claim. Firstly, David Hilbert 
has stated “the infinite is nowhere to be found in 


reality, no matter what experiences, observations, 
and knowledge are appealed to.” (Hilbert 1964). I 
strongly concur. Secondly, in any case such claims 
are not verifiable, for there is no possibility what- 
ever of verifying them (no matter how many en- 
tities you have counted, you have not proved an 
infinity exists). If science is to do with testable 
claims, then any such claims are not science. 


For other motivations for the multiverse, argu- 
ments in its favor, and counterarguments, see my 
Scientific American article (Ellis 2011), the book 
edited by Carr (2009), and Kragh (2012). 
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YI Smallest number with 7 divisors. Index of Graham's 
number in the sequence 3, 27, 7625597484987, ... 


Primes and Particles 


Jack Williams, Clare 


he long-studied and incredibly elusive Rie- 

mann zeta-function has baffled number 

theorists for centuries. Deeply connected to 
the prime numbers and their distribution among 
the integers, its importance in number theory is 
well known. The Riemann hypothesis is perhaps 
the most famous unsolved problem in mathemat- 
ics, not least because the Clay Mathematics Insti- 
tute has offered $1 million for a solution. 


However, interest in this mysterious function 
extends far beyond esoteric results in number 
theory. With implications for physics, probability 
and statistics, it is more than just a number theo- 
retic curiosity. The underlying distribution of the 
zeros along the critical line R(s) = ; penetrates 
many branches of mathematics and there has 
been growing interest in the obscure connections 
it reveals between these fields. 

When defined as 

1 
n 


t(s) = » 


for R(s) > 1, it seems unconnected to the primes. 
But Euler first noticed the factorisation 


1 1 
+ 3a 60 


which holds by the fundamental theorem of arith- 
metic. Summing the geometric series yields 


(s)= TT — 


p prime 1-p* 


This factorisation, which can be made rigorous, 
reveals a deep connection with the primes. 


As an illustration of the zeta-function’s role in 
number theory, we give simple proof of the famil- 
iar Euclidean theorem. 


Theorem There are infinitely many primes. 


Proof: Suppose not. Then taking s —~ 1* in the 
identity, 


= 1 1 
2 et er 
gives 
a1 1 
aes 7 IT yl 
n=l p prime P 


which must be finite because there are only finite- 
ly many terms in the right hand side. But since the 
harmonic series diverges, this is a contradiction. 


Scattered recklessly throughout the integers, the 
primes have been studied for thousands of years. 
Gauss’ prime number theorem states that 

n 


In(n) 


where 7() is the number of primes less than or 
equal to n. This means that the probability that is 
prime is roughly 757; for large n. Here n need not 
be very large for a good approximation to 7(n). 
For n = 100, it estimates about 21 primes when the 
correct value is 25. 


m(n) ~ 


Smallest number that becomes a square when its 
reverse is either added to or subtracted from it. 
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Probability Density 
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In 1859, Riemann published another link between 
the zeta-function and the primes, giving an exact 
formula for 2(”), which improved on the prime 
number theorem. Riemann’s formula uses the 
location of zeroes in the analytic continuation 
of the Riemann zeta-function. Number theorists 
are interested in it because the distribution of ze- 
roes gives information about the distribution of 
primes. 


While prime numbers are sparser among the larg- 
er integers, the Riemann zeroes become denser 
further from the real line. It is possible to show 


that 
Tln(T) 


20 

where n(T) =|{ye R: ¢(4 + yi) =0,0<y< T}I. 
However, these zeroes are not arranged regularly 
along the critical line and the finer detail has been 
studied in more depth. 


n(T) ~ 


One way to get information about this distribu- 
tion is to study a spacing distribution, the dis- 
tribution of the distances between consecutive 
zeroes. These distances can be normalised to ac- 
count for the ‘global’ distribution given by n(T). 
The correct normalisation is )j = Yin) which 
results in the following distribution, estimated 
numerically using zeroes numbered 107! + 1 to 
107! + 10000. 


Surprisingly, this distribution arises naturally in 
other areas of mathematics and physics and seems 
to be fundamental to many seemingly unrelated 
problems. Experiments have been performed in 
which neutrons are scattered off a heavy nucleus. 
The resulting cross-section contains peaks, the 
scattering resonances, and troughs. If a neutron’s 
energy is near to one of the peaks, it is repelled by 
the nucleus and if it is near a trough, it can pass 


Largest number below 2000 not 
containing an ‘e’ (an eban number). 


<4 Probability density 
function for the normalised 
spacing between consecu- 
tive zeroes of the Riemann 
zeta-function 


0.25 
Normalised Spacing between consecutive zeros 


0.30 


through effortlessly. This strange behaviour leads 
to different scatterings. Although these problems 
are too difficult to solve analytically or numeri- 
cally, empirical data can be obtained and a similar 
analysis to the separation of the zeroes can be per- 
formed on these peaks. Astonishingly, the proba- 
bility density matches that of the Riemann zeroes. 
In fact, the density does not depend greatly on the 
particular nucleus being used. This remarkable 
connection is still poorly understood. 


The mathematical structure which models the 
scattering resonances is also quite unexpected. In 
the 1950s, Eugene Wigner proposed a statistical 
model based on random matrices. Inspired by 
Heisenberg’s formulation of quantum mechanics, 
in which properties of an atom or a nucleus can be 
described by a Hermitian matrix, he put forward 
the random-matrix conjecture. He suggested that 
these peaks follow the same spacing as eigenval- 
ues of Hermitian matrices whose elements are 
chosen from some probability distribution, usu- 
ally normal. The eigenvalues of the matrix are cor- 
respond to energy levels of the spectrum. Using 
such random matrices, one can obtain spacings 
that are statistically similar to both the neutron 
scattering data and the distribution of zeroes of 
the Riemann zeta-function. 


Although it is possible that the zeroes play some 
direct role in physical systems, the unexpected 
connection between the primes, matrices and 
neutron scattering may reflect a deeper result, 
uniting many seemingly disjoint areas of math- 
ematics. There has been progress in this direc- 
tion. Theorems have been proved showing that 
there is a single limiting distribution for the ei- 
genvalue spectrum arising from a large set of ran- 
dom matrices. Similar in spirit to the central limit 
theorem, these results mirror the type universality 
intrinsic to results which connect many areas of 
mathematics. 


Irregular and lucky and Pillai prime. Maximum 
number of letters in website domains. 
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T his will not be an article about mathematics 
or theoretical physics but more about why 
we carry out such activities and the com- 
mon ground mathematicians share with those 
in the arts. When we look at the long history of 
mathematics there is a deep aesthetic sensibility 
present in the writings of mathematicians. Com- 
mon references to beauty, symmetry emerge as 
almost a guiding tool. After all, mathematics is 
not just about what is true but what is interesting. 

What is interesting and beautiful seems somewhat 
subjective and yet when faced with the profound 
beauty of any number of mathematical theorems 
(the reader should think of her favourite theo- 
rem at this point) then it just seems so blindingly 
obvious that no explanation is ever needed. I am 
a theoretical physicist and I remember still the 
spine tingling excitement at first seeing Noether’s 
theorem. This link between the symmetries of the 
laws of nature and conserved quantities is to me 


In anormal distribution, 68% of values are 
within one standard deviation of the mean. 


an example of the irresistible beauty that math- 
ematics can uncover in nature. 


A typical example of the aesthetic raptures of 
mathematicians is found in Poincare’s famous 
quote, “The scientist does not study nature be- 
cause it is useful. He studies it because he delights 
in it and delights in it because it is beautiful...I do 
not talk here of the type of beauty that strikes the 
senses but a profound beauty that comes from the 
harmonious order of the parts.” 


Poincare’ “harmonious order of the parts” is 
somehow the mathematical beauty we know and 
love. His “beauty that strikes the senses” is I sup- 
pose a reference to more common ideas of beauty 
present in the visual arts. And yet are the two so 
necessarily different or can one reflect the other 
though perhaps with the inevitable loss of an im- 
perfect reflection. 


When we look at some of the goals and ideas of 
artists there is often a theme uncovering different 
“ways of seeing” and of challenging our everyday 
view of the world. (This is a quick and perhaps 
over easy generalisation but let me continue). In 
the 20th century, the artist followed many of the 
revolutions of 20th century physics with a remov- 
al of direct representation and a strong foray into 
abstraction. In Mondrian’s seminal essay, “Natu- 
ral reality and abstract reality’, the artist puts 
forward a manifesto justifying the abstraction of 
universals from nature, concentrating on essential 
building blocks and looking at structural connec- 
tions while ignoring the detail present in the in- 
dividual objects to bring out the common, shared 
structures in nature. His essay is comfortable 
reading for a contemporary theoretical physicist. 


And so in goals and in a very human belief in aes- 
thetics perhaps the differences are not so great af- 
ter all. The language of mathematics is where bar- 
rier to communication lies. Its enormous power 
prevents the concepts from being accessible or 
visceral. The challenge then is to capture some- 
thing of an idea or concept in physical work. 


Duality is a key idea in mathematics and is now 
at the centre of the frontier in theoretical phys- 
ics, string and M-theory. (One can argue whether 
duality really means the same thing in the context 
of mathematics and in theoretical physics but we 
will pass over this). 


< “Generalised Geometry 1&2”, Berman and Davey 
A “125 GeV" in wood, Berman and Davey 


In string theory, there is a duality symmetry 
known as T-duality. This is a fundamental ambi- 
guity in the description of the space time back- 
ground in which the string lives. If the space time 
has some specific properties (technically, it should 
possess an isometry and be compact so that its 
first homotopy class is nontrivial) then there will 
be two backgrounds that will be related to each 
other that in ordinary differential geometry will 
be inequivalent and yet will be indistinguishable 
from the point of view of the string. These pairs 
are known as T-duals. This duality is stringy in 
nature and leads on to the idea of stringy geome- 
try that differs from our usual notion of geometry 
in that such ideas of T-duality get built in. 


The Turner prize winning artist Grenville Davey, 
has spent many years working on sculptures that 
should be seen in isolation but as objects that 
bring out relationships and symmetries. They are 
collections of objects which when brought togeth- 
er show a relation. 


Together Grenville and I have been exploring 
ways to have a sculptural manifestation of some of 
the ideas in theoretical physics such as T-duality 
and spontaneous symmetry breaking. The goal is 
not to explain or exemplify but simply to inspire. 


Its square and its cube together contain every digit 
exactly once. Largest factorial on many calculators. 
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Computer representations of T-dual manifolds 

though perhaps accurate are not what we were af- 
ter. Instead the works were to be influenced by the 

mathematics rather than represent the mathemat- 
ics in some faithful way. This allowed the process 

to be free and in the end driven by the detailed 

aesthetics of the pieces themselves than by faith- 
fulness to an idea. The result has been a series of 
sculptures that have been shown in various gal- 
lery spaces but also in the Isaac Newton Institute 

for mathematical sciences. 


A "125 Gev"”, Berman and Daveyy 
Vv “Generalised Geometry”, Berman and Davey 


Who knows whether there is anything of T-dual- 
ity or symmetry breaking in these works. What 
is interesting is the process by which the pieces 
came about and the fact that some very abstract 
mathematics has given rise to some pieces of 
sculpture and influenced the mind of a leading 
British sculptor. 


One of only two to be both Pell number 
and a generalized heptagonal number. 


Algebraic degree of Conway's constant. Largest 
prime factor of the order of a sporadic simple group. 
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| limate change is now widely expected to 

( cause significant changes to conditions on 
Earth in the next century, with our actions 
playing a key role in determining what happens 
next. One of the less well understood effects is sea 
level rise. This will likely be dominated by glacier 
retreat in Antarctica and Greenland. 


It is these glaciers in particular that we attempted 
to understand this summer. Our method was to 
use a laboratory model for ice, which captures a 
hitherto neglected but we believe critical aspect: 
changes in viscosity. With a high rate of shear 
(velocity gradient), there is increased melting be- 
tween adjacent crystals. This leads to a reduction 
in viscosity. 


Our laboratory model for ice was Xanthan, a 
shear-thinning biological polymer. We started 
by considering the ice shelf, believing (or hoping 
for) the sheet to have been solved for already, as 
a viscous gravity current. The situation without 


Sluice 
Pump 


Weir 
y Side View 


Salt Water 


sidewalls was simple enough for us to solve it 
analytically without experiments to guide us. We 
found that the front goes as a power law in time 
(it accelerates). The other key result was that, for 
a fixed source thickness and fixed entry flux, the 
thickness at any location does not change (once 
the front reaches it). However, the source thick- 
ness remained a mystery. The width is still un- 
known (though it increases if the flux is higher). 


Happy with this very early (partial) breakthrough 
(the first real one in our careers), we then at- 
tempted to understand the effect of sidewalls. The 
motivation was ice shelves inside canyons, which 
is not at all unusual. Also common is slowly flow- 
ing pack ice completely filling a bay, leading to 
some friction at the edges. A typical experiment 
is shown in operating configuration, near the end 
of a run. 


Almost everything was made inside the workshop 
of the GK Batchelor Laboratory. We pioneered a 


Sea Level 
Control 


< Figure 1 The basic experimental 

setup. The flow is from left to right — 

\ under the sluice, over the weir, into 

| the ocean. This makes it more uniform 
across the channel - essential if we 
want good sidewall contact. 
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A Figure 2 Shown here is the ratio of the frac- 
tional change in the front position to the fractional 
change in time, over a short period. Note that the 
tank is 15cm wide, so power-law behaviour starts at 
approximately three times this. 


sea level control system, to stop large rises in sea 
level as Xanthan enters the ocean. Note the mini- 
mal drop in Xanthan from the weir into the ocean 
— this is because the sea level is < 1 mm below the 
top of the weir. Without removing saltwater from 
the ocean, wed be forced to accept a 2 cm drop, 
potentially affecting the shelf over a large region. 
Also, conditions would alter significantly during 
the experiment (as the 2 cm gap goes down to 0). 
Fortunately, our control system was able to hold 
the sea level constant to within 1 mm. 


Although there were other problems (like 3 specks 
of rust on the weir ruining 4 runs, until we real- 
ised); we mention the sea level in particular as 
we designed the system ourselves and because it 
wasnt controlled at all in a previous study, leading 
to it being fundamentally flawed. 


The key parameters (front position and peak 
thickness, near the source); are clearly visible in 
Figure 1. We took a photo like this every second. 
We speeded up the data analysis by using Matlab 
to trace the outlines (it’s 2012, this sort of thing is 
easy). What we were looking for were clues - in 
particular, if the front was going as a power law in 
time. But there was another major problem, and 
our apparatus wasn't to blame. 


The Xanthan was at a 1% concentration — this was 
too viscous. It didn't really have enough time to 
spread laterally, so the thickness at the walls was 
low, even in the best experiments. This meant that 
sidewall friction might not be dominating the sys- 
tem (compared to water pressure). 


log( front position), rescaledd 
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A Figure 3 Front position plotted against time on 
logarithmic axes, with rescaling according to chang- 
es in flux and other parameters. The red band is our 
theory, allowing for errors. The blue and green data 
points are from experiments set up slightly differ- 
ently, while the black ones are considered unreliable. 


Although we got a tentative indication of power 
law-type behaviour, we weren't happy and re- 
duced the concentration to 0.5% after a quick 
test. Then we had a real Eureka moment, though I 
didn't fully appreciate what it meant (unlike some 
of the people on the team). As can be seen below, 
the behaviour did indeed appear to converge to a 
power law. The uptick at the end is due to seawa- 
ter extraction sucking the shelf with it (a little). As 
the front slows down, the flat region on the graph 
below actually lasts for 100 seconds (in a 250s ex- 
periment). 


This was the defining moment of our project. We 
rapidly found more experiments indicating simi- 
lar behaviour. Then, the quest was on to explain 
it - everyone was more confident than me that we 
would now be able to succeed for sure. The good 
thing was that the value of 0.6 above is only a bit 
different to what would have happened if xanthan 
was a Newtonian fluid (value = 0.67). However, I 
didn't share the enthusiasm that the shear-thin- 
ning nature of xanthan was having only a minor 
effect - maybe on the numbers, but I thought it 
made things fundamentally different. 


We thought about it very carefully and eventually 
realised that, amazingly, the behaviour is indeed 
similar to the Newtonian case (where the flow 
is essentially a Poiseuille flow, with gradients in 
thickness driving it). Once the shelf is very long so 
sidewall friction dominates, then for our case, we 
get what we termed a generalised Poiseuille flow. 
The velocity is still polynomial in lateral position, 
but (for a shear-thinning fluid) there's a sharper 
edge (i.e. the power is more than 2). 
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Speed 


0 200 400 600 800 
Position across channel 


A Figure 4 Speed asa function of position across 
the channel. The theoretical curve is the smooth 
blue one. Raw data is in red, but errors are present 
so any value within the green curves is consistent 
with the data. Note that the tank is 15cm wide. 


We didn't see Figure 3 straight away, of course. It 
was quite something to see it slowly emerge — we 
didn't have the theory fully worked out until half 
the experiments were done. What was especially 
crucial was that we had the gradient measured 
before there was a theory, and the gradient told 
us how the frictional force scales with veloc- 
ity. Knowing there was viscous drag in a narrow 
boundary layer near the walls, this measurement 
meant we realised within days what was going on, 
allowing the theory to be developed. This would 
have taken much longer without the data. 


We also tracked particles that we put into the 
Xanthan. The results are shown below. The lateral 
velocity profile agreed very well with our model. 
The (peak) velocity where the PIV (particle track- 
ing) was done is 0.246 + 0.010 cm/s. 


The speed of the front at the same time was 0.27 
+ 0.01 cm/s. We expected the latter to exceed the 
former by 11%. The key thing is that such varia- 
tions lead to forces other than from sidewalls, but 
such forces are not important for long enough 
shelves (we hoped). Proving that such velocity 
variations are small meant such forces were small 
and so our model was fundamentally correct (i.e. 
it got the dominant force balance). We also com- 
pared the thickness profile along the channel with 
our prediction (that it’s nearly a perfect triangle). 
This also showed very good agreement. 


We then turned our attention to better under- 
standing ice tongues (i.e. without sidewalls). The 
above velocity profile would become uniform 
across the channel. 


There are 74 different non-Hamiltonian 
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A Figure 5 Side view of our shelf. Theory predicts 
the apparent discrepancy in the middle, where the 
gradient exceeds that near the source (right) — this 
should cause a variation in velocity along the tank. 
A Figure6 Here we zoom into the region near 
the front (left) in Figure 5, which should behave 

as if there are no sidewalls. (Thus, the thickness 
should not vary.) A black horizontal line is drawn on. 
Although this is a small length of shelf, the discrep- 
ancy with the sloped red line is obvious. Note that 
there are seeds in the Xanthan (for PIV). 


We expected the grounding line to stabilise, be- 
cause like any floating object there should be 
some equilibrium. In our model, there's no ten- 
dency to thicken once some sort of equilibrium 
is attained. Based on the extended flatline at 1 on 
Figure 2 and our theory, we expected the front to 
go linearly with time for the whole experiment 
(no drag to slow it down)! The acceleration inher- 
ent in our theory (described previously) was pre- 
dicted to be tiny in shelves of this length. Using 
the idea of forces along the channel balancing at 
the grounding line and using our understanding 
of what the forces are in the shelf (plus the vis- 
cous gravity current theory to determine this in 
the sheet), we created a computer model to pre- 
dict the grounding line thickness. 


We had previously done experiments without 
sidewalls. These were the most accurate ones we 
did, because we intended a 9 cm wide shelf in a 
15 cm wide tank not to hit the sidewalls in a tank 
90 cm long. The tank was levelled to within 2 arc- 
minutes, so we succeeded. We quickly realised 
that the front speed was constant! But, how thick 
was the shelf? Of course, at constant Q and con- 
stant front speed (and constant width, see photo- 
graph) the thickness of the whole shelf had to be 
constant. Photographs (not shown) revealed no 
surprise. But, we hadn't designed the experiments 
to measure the grounding line thickness, which 
was quite low (1 cm at most). 


In the end, we knew Q was constant and so was the 
front speed, and the width and height appeared 
unchanging in time and space. This allowed us to 
use flux conservation plus front speed and width 
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< Figure 7 The situation 
when there is a grounding 
line. Note that the tapering 
of the shelf occurs (in a 1m 
tank) only if there is contact 
with the sidewalls. 


Sloped Bed 


measurements to infer the thickness of the shelf 


(we always knew Q extremely precisely). To check, 
we counted pixels in the side view, but the very 
low thickness and shadows etc. meant this was 
inaccurate. However, it revealed consistency with 
the indirect measurements outlined above. These 
agreed very closely with our computer model! 


Next, we looked at an interesting effect (see 
comparison of real and laboratory models of ice 
tongues). This is due to a variable width at the 
grounding line, likely caused by a variable entry 
flux. After the grounding line, the shelf moves as 
a solid body. For a real glacier, Q oscillates annu- 
ally but for us it oscillated due to the action of our 
pump (which is peristaltic). We still don’t know 
precisely how Q affects the width, but the tapering 
at the front (when Q was rising from 0 to its final 
value, as the experiment had only just started); 
indicates that greater fluxes lead to a wider shelf. 


The effort to understand more is still underway, 
but our involvement in it is likely over with the 


completion of this project. The end result is (this 
time) fewer questions than we started with, be- 
cause some have definitely been answered, in- 
cluding the most crucial one — what's the domi- 
nant force balance (mathematically)? 


If anyone wants to know what it took to get this 
far, basically it’s determination and hard work. If 
we could see how to do something but it would 
be very hard, then we would always remind our- 
selves that we're lucky - sometimes, you can only 
wish you know what to do. Also crucial to our 
success was not feeling tied to anything that any- 
body (however experienced) predicted about the 
situation (i.e. believing it must be that way before 
the data came in). Instead of blindly trusting any- 
body’s ideas, we believed in ‘going with the flow, 
being guided by data and intuition and above all 
else having an unquenchable confidence in our 
ability to make at least partial progress on the road 
to understanding glaciers, even if nothing made 
sense and experiments didn’t work (because who 
really knows what tomorrow will bring)? 
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Finding Order 
in Randomness 


Maithra Raghu, Trinity 


iven a complete graph on six vertices, 
G denoted K, (a graph where every vertex 

is connected to every other vertex); we 
colour each edge of the graph either red or blue. 
Can we find a complete graph on three vertices 
(aka a triangle) such that all its edges are the same 
colour? What about for a bi-coloured K,); can we 
find a monochromatic K,? 


In both cases, it is indeed possible. These prob- 
lems are an example of the finite case of a theorem 
in Ramsey Theory. 


Ramsey Theory is named after the British math- 
ematician FRANK PLUMPTON RAMSEY (1903 — 
1930) whose paper, On a Problem of Formal Logic 
(1928), proved what is now known as Ramsey’s 
Theorem. This was not the first theorem proved in 
the area of modern Ramsey Theory; Issa1 SCHUR 
proved in 1916 that there always exist monochro- 
matic x, y, z in a finite colouring of the naturals 
such that x + y = z and VAN DER WAERDEN his 
eponymous theorem in 1927. However, Ramsey’s 
work was imperative in ensuring that all these re- 
sults were viewed collectively under one area and 
encouraging further research in this field. 


We have mentioned some problems for colouring 
a finite set of points; but what kind of patterns 
emerge if we colour an infinite set of points? 


WNFC (When Naturals 
are Finitely Coloured) 


We start simply, assuming as before, that we only 
have two colours, blue and red, at our disposal. 


Smallest integer without unique or interesting 
properties worth mentioning here... 


We now consider colouring edges of the complete 
graph with N as its vertex set with these two col- 
ours. This is a daunting thing to imagine, so we 
shall introduce some (abuse of) notation to make 
things easier. 


Let N® denote distinct pairs of natural num- 
bers such that order does not matter i.e (a, b) = 
(b, a). Then our colouring is simply the function 
c:N® — {blue, red}. 


We can now picture our daunting infinite com- 
plete graph as below, a sequence with pairs of the 
sequence connected by either red or blue lines. So 
a monochromatic subgraph in this context is sim- 
ply a subset of N on which c is constant. 


We now claim that we can find a subset M of N, 
with 

1. M infinite 

2. M® monochromatic 


To prove this, we pick any natural number a,. All 
the lines coming out of a, are either red or blue. 
So there is some infinite subset B, of N - {aj} 
such that all the lines from a, to B, are of the same 
colour (pigeonhole two and &,). We then pick a, 
in B, and pick infinite B, such that all lines from 
a, to B, are monochromatic. We keep repeating 
to get a sequence dj, a5, a3 ,... with each a; having 
either red or blue associated with it. As there are 


only two colours, some colour occurs infinitely 
often and the terms associated with this colour 
give us our required set. 


An interesting diversion 


We can now prove that any sequence in a totally 
ordered set has a monotone subsequence. Let 
C(a;,a;) denote the colour of the edge (a; a;). 
Then 

1. C(a,aj)=red  ifa;> a; 

2. C(a;, aj) = blue ifa;< a;. 
But now we know we have a monochromatic set, 
which corresponds to a monotone sequence. 


This gives another way to prove the Bolzano- 
Weierstrass theorem! We let our totally ordered 
set be the reals, and we have proven that every 
bounded sequence has a monotone subsequence. 
But by the Fundamental Theorem for reals, any 
monotone sequence converges, so we have our 
convergent subsequence. 


And back to where we left... 


Two natural extensions follow: 


1. When two-colouring N for some finite r, 
can we find a subset M as before? 


2. What happens if we use some finite 
number k of colours instead? 


It is indeed possible to find a set M as before in 
both cases. 


For part 1, we proceed by induction. The case r= 1 
is an application of the Pigeonhole principle, and 
the case r = 2 is what we have just proven. So let us 
assume the result for r= k and consider r= k + 1. 
As before, we pick some a, in N. Now notice that 
this induces a two colouring on (N - {a,})". 
The diagram shows the case for r = 3: 


M 


a, 


And by our induction hypothesis, there is an in- 
finite monochromatic set M for this colouring, 
say colour red. But as the colouring was induced 
on our (r - 1)-tuples in M by removing, when we 
add {a,} back; we recover our r-tuple colouring, 
which has the same colouring as the (r- 1)-tuple. 


So M u {a,} provides the set we were looking for. 


And now for part 2, increasing the number of col- 
ours we can create disorder with. The case for col- 
ouring N® with some finite number k of colours 
turns out to be surprisingly easy, using the ideas 
of induction and “colour-blindness”. 


We induct on the number of colours k. We as- 
sume that we only have two colours, red and eve- 
rything-but-red. Then by our previous work, we 
know there is a monochromatic set M. If M is red, 
then we are done! If not, then we've just reduced 
to k -1 colours, which is soluble by the induction 
hypothesis. 


Infinite to Finite 


Before plunging back into the infinite, note that 
we can prove finite Ramsey from what we know 
of infinite Ramsey. We write [7] = {1, 2, 3, ..., n}. 


Theorem Let m,r¢ N. Then there exists n 
such that whenever [n]" is two coloured, there 
is a monochromatic set A c [1] of size m. 


The proof is left as an exercise. Try and construct 
a two colouring of N“ without a monochromatic 
M, providing the contradiction. 


The canonical Ramsey Theorem 


We have managed to avoid the most daunting 
question yet. What happens if we dare to colour 
N? with infinitely many colours? The question 
seems inane; what kind of pattern could we hope 


It is possible for a sudoku puzzle to have up to 
77 givens (out of 81), yet lack a unique solution. 
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A Possible Structures of M 


to find? A monochromatic set M is out of the 
question; we have the power to pathologically col- 
our every point in N? a different colour. Yet even 
in this melange of colourings there is some order. 


Theorem If we have an arbitrary colouring 
of N®), there exists an infinite M of one of the 
following forms, for arbitrary i< j<k< I: 


(i) Mis monochromatic; 

(ii) Each point in M® has 
a different colour; 

(iii) (4, j), (kD) in M™ have the 
same colour iff i = k; 

(iv) (i,j), (kD) in M@ have the 
same colour iff j =]; 


These four possibilities are shown in the diagrams 
above. 


Here we give a sketch proof of this theorem. In 
the proof, we are going to work with sets of the 
form A“ and colour with the colours of ‘same’ 
and ‘different’ (based on certain properties of our 
set), and then use our previous work to find sets 
monochromatic in ‘same’ or ‘different’ to filter out 
our desired properties. 


We first deal with the monochromatic case. 


First we two-colour N* by giving (i,j,k, 1) colour 
« ‘same if C(i,j) = C(k,]) in original colouring; 
¢ ‘different’ otherwise. 


By our previous work, there exists an infinite 
monochromatic set A,. If A; is the colour ‘same’ 
then it is monochromatic (exercise). So we as- 
sume A, is ‘different. 


Note that A, being ‘different’ implies that C(i,j) 
never equals C(j,k) in Aj, else we obtain a contra- 
diction (exercise). 


We define ‘left same’ to be if C(i,j) = C(i,k) and 
‘left different’ otherwise, and ‘right same’ as C(j, k) 
= C(i,k) and ‘ight different analogously. Now we 
two colour A, twice, to obtain A, and A; as fol- 
lows. We first find an A, monochromatic in ‘left 
same’ / ‘left different’ and from that form a mono- 


Smallest integer that can be written as the 
sum of four distinct squares in three ways. 
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chromatic A; in ‘right same’ / ‘right different. We 
now have various different cases: 


« IfA; is ‘left different’ and ‘right same’ then it 
is case (iv) from above. 


e If A, is ‘left same’ and ‘right different’ then it 
is case (iii) from above. 


¢ Note that A; cannot be right and left same, 
as it is a contradiction to A, being ‘different’ 


« Finally, we have the case where A; is dif- 
ferent in both sides. Note that we can find 
a subset A; such that A; is monochromati- 
cally different both regarding C(i,!) = C(j,k) 
as ‘same’ and C(i,k) = C(j,1) as ‘same. 


In each case, we can find a monochromatic ‘dif- 
ferent’ set M, as M being ‘same’ would result in a 
contradiction with A, being ‘different’. Indeed, if 
we found a ‘same’ M for C(i,l) = C(j,k), then pick- 
ingi<j<k<l<m<n, we have C(i,n) = C(j,k) 
and C(i,n) = C(I, m), so C(j,k) = C(L,,) in A,, which 


is a contradiction. 


This A, satisfies case (ii) from above, and we are 
done! 


In Conclusion 


In this article, we have focussed solely on Ramsey's 
Theorem. But modern Ramsey Theory extends far 
beyond this. Van der Waerden’s theorem looks at 
finding monochromatic arithmetic progressions. 
Though originally considered a cornerstone of 
Ramsey Theory, RICHARD Rapo’s astounding ex- 
tension of Schur’s Theorem, suitably called Rado’s 
Theorem, gives an immediate solution to Van der 
Waerden and all its extensions. HINDMAN’s Theo- 
rem (proved 1973) then extends Rados ideas of 
partition regularity to an infinite setting. Despite 
these leaps in understanding, we still lack basic 
information in many areas of Ramsey Theory. In- 
deed, even now, it is not entirely clear how Rado 
managed to conceive his ground breaking theo- 
rem. This absence of full understanding, com- 
bined with the relative clarity of the problems 
ensures that Ramsey Theory will be a fruitful and 
fascinating area of research in the future. 


Atomic number of Gold. A fortunate, Gaussian, 
happy, Higgs, Kynea, lucky, Pillai and regular prime. 
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Mathematics 


in Wartime 
G H Hardy 


First published in issue 3, 1940 


he editor asked me at the beginning of term 

to write an article for Eureka, and I felt 

that I ought to accept the invitation; but all 
the subjects which he suggested seemed to me at 
the time quite impossible. “My views about the 
Tripos” - I have never really been much interested 
in the Tripos since I was an undergraduate, and I 
am less interested in it now than ever before. “My 
reminiscences of Cambridge” - surely I have not 
yet come to that. Or, as he put it, “something more 
topical, something about mathematics and the 
war” — and that seemed to me the most impossible 
subject of all. I seemed to have nothing at all to say 
about the functions of mathematics in war, except 
that they filled me with intellectual contempt and 
moral disgust. 


Ihave changed my mind on second thoughts, and 
I select the subject which seemed to me originally 
the worst. Mathematics, even my sort of math- 
ematics, has its “uses” in war-time, and I suppose 
that I ought to have something to say about them; 
and if my opinions are incoherent or controver- 
sial, then perhaps so much the better, since other 
mathematicians may be led to reply. 


I had better say at once that by “mathematics” I 
mean real mathematics, the mathematics of Fer- 
mat and Euler and Gauss and Abel, and not the 
stuff which passes for mathematics in an engineer- 
ing laboratory. I am not thinking only of “pure” 
mathematics (though that is naturally my first 
concern); I count Maxwell and Einstein and Ed- 
dington and Dirac among “real” mathematicians. 


Days needed by Phileas Fogg to travel around the 
world. Number of columns in IBM punch cards. 


I am including the whole body of mathematical 
knowledge which has permanent aesthetic value, 
as for example, the best Greek mathematics has, 
the mathematics which is eternal because the 
best of it may, like the best literature, continue to 
cause intense emotional satisfaction to thousands 
of people after thousands of years. But I am not 
concerned with ballistics or aerodynamics, or any 
of the other mathematics which has been specially 
devised for war. That (whatever one may think of 
its purposes) is repulsively ugly and intolerably 
dull; even Littlewood could not make ballistics 
respectable, and if he could not, who can? 


Let us try then for a moment to dismiss these sin- 
ister by-products of mathematics and to fix our 
attention on the real thing. We have to consider 
whether real mathematics serves any purposes 
of importance in war, and whether any purposes 
which it serves are good or bad. Ought we to be 
glad or sorry, proud or ashamed, in war-time, that 
we are mathematicians? 


It is plain at any rate that the real mathematics 
(apart from the elements) has no direct utility in 
war. No one has yet found any war-like purpose 
to be served by the theory of numbers or relativity 
or quantum mechanics, and it seems very unlikely 
that anybody will do so for many years. And of 
that I am glad, but in saying so I may possibly en- 
courage a misconception. 


It is sometimes suggested that pure mathemati- 
cians glory in the “uselessness” of their subject, 
and make it a boast that it has no “practical” ap- 
plications. I have been accused of taking this view 


Godfrey Hardy (1877 - 1947) is not only famous > 
for his achievements in number theory and 
analysis, but for mentoring Srinivasa Ramanujan 
and for writing “A Mathematician’s Apology” 

on the aesthetics of mathematics. 


myself. I once stated in a lecture, which was after- 
wards printed, that “a science is said to be useful 
if its development tends to accentuate the existing 
inequalities in the distribution of wealth, or more 
directly promotes the destruction of human life”; 
and this sentence, written in 1915, was quoted in 
the Observer in 1939. It was, of course, a conscious 
rhetorical fluorish (though one perhaps excusable 
at the time when it was written). 


‘The imputation is usually based on an incautious 
saying attributed to Gauss, to the effect that, if 
mathematics is the queen of the sciences, then 
the theory of numbers is, because of its supreme 
“uselessness, the queen of mathematics, which 
has always seemed to me to have been rather 
crudely misinterpreted. If the theory of numbers 
could be employed for any practical and honour- 
able purpose, if it could be turned directly to the 
furtherance of human happiness or the relief of 
human suffering (as for example physiology and 
even chemistry can), then surely neither Gauss 
nor any other mathematician would have been so 
foolish as to decry or regret such applications. But 
if on the other hand the applications of science 
have made, on the whole, at least as much for evil 
as for good - and this is a view which must al- 
ways be taken seriously, and most of all in time of 
war — then both Gauss and lesser mathematicians 
are justified in rejoicing that there is one science 
at any rate whose very remoteness from ordinary 
human activities should keep it gentle and clean. 


It would be pleasant to think that this was the 
end of the matter, but we cannot get away from 
the mathematics of the workshops so easily. In- 


directly, we are responsible for its existence. The 
gunnery experts and aeroplane designers could 
not do their job without quite a lot of mathemati- 
cal training, and the best mathematical training 
is training in real mathematics. In this indirect 
way even the best mathematicians becomes im- 
portant in war-time, and mathematics are wanted 
for all sorts of purposes. Most of these purposes 
are ignoble and dreary - what could be more 
soul-destroying than the numerical solution of 
differential equations? - but the men chosen for 
them must be mathematicians and not laboratory 
hacks, if only because they are better trained and 
have the better brains. So mathematics is going 
to be really important now, whether we like it or 
regret it; and it is not so obvious as it might seem 
at first even that we ought to regret it, since that 
depends upon our general view of the effect of sci- 
ence on wat. 


There are two sharply contrasted views about 
modern “scientific” war. The first and the most 
obvious is that the effect of science on war is 
merely to magnify its horror, both by increasing 
the sufferings of the minority who have to fight 
and by extending them to other classes. This is 
the orthodox view, and it is plain that, if this view 
is just, then the only possible defence lies in the 
necessity for retaliation. But there is a very dif- 
ferent view which is also quite tenable. It can be 
maintained that modern warfare is less horrible 
than the warfare of pre-scientific times, so far at 
any rate as combatants are concerned; that bombs 
are probably more merciful than bayonets; that 
lachrymatory gas and mustard-gas are perhaps 
the most humane weapons yet devised by military 


Number of stable chemical elements. One of three 
numbers to be the square of the sum of its digits. 
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science, and that the “orthodox” view rests solely 
on loose-thinking sentimentalism. This is the 
case presented with so much force by Haldane in 
Callinicus. It may also be urged that the equalisa- 
tion of risks which science was expected to bring 
would be in the long run salutary; that a civilian’s 
life is not worth more than a soldier’s, or a wom- 
an’s than a man’s; that anything is better than the 
concentration of savagery on one particular class; 
and that, in short, the sooner war comes “all out” 
the better. And if this be the right view, then sci- 
entists in general and mathematicians in particu- 
lar may have a little less cause to be ashamed of 
their profession. 


It is very difficult to strike a balance between 
these extreme opinions, and I will not try to do 
so. I will end by pulling to myself, as I think every 
mathematician ought to, what is perhaps an easier 
question. Are there any senses in which we can 
say, with any real confidence, that mathematics 
“does good” in war? I think I can see two (though I 
cannot pretend that I extract a great deal of com- 
fort from them). 


In the first place it is very probable that mathe- 
matics will save the lives of a certain number of 
young mathematicians, since their technical skill 
will be applied to “useful” purposes and will keep 


The sixth nuclear magic number. Number of 6-hexes 
(shapes made by joining six regular hexagons). 
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them from the front. “Conservation of ability 
is one of the official slogans; “ability” means, in 
practice, mathematical, physical, or chemical abil- 
ity; and if a few mathematicians are “conserved” 
then that is at any rate something gained. It may 
be a bit hard on the classics and historians and 
philosophers, whose chances of death are that lit- 
tle much increased; but nobody is going to worry 
about the “humanities” now. It is better that some 
should be saved, even if they are not necessarily 
the most worthy. 


Secondly, an older man may (if he not too old) 
find in mathematics an incomparable anodyne. 
For mathematics is, of all the arts and sciences, 
the most austere and the most remote, and a 
mathematician should be of all men the one who 
can most easily take refuge where, as Bertrand 
Russell says, “one at least of our nobler impulses 
can best escape from the dreary exile of the actual 
world.” But he must not be too old - it is a pity 
that it should be necessary to make this very seri- 
ous reservation. Mathematics is not a contempla- 
tive but a creative subject; no one can draw much 
consolation from it when he has lost the power or 
the desire to create; and that is apt to happen to a 
mathematician rather soon. It is a pity, but in that 
case he does not matter a great deal anyhow, and 
it would be silly to bother about him. 


© Foundation MathArt Koos Verhoeff 


A Mitered Fractal Tree 
Koos Verhoeff and Anton Bakker 


Atomic number of the heaviest stable element, 83 
Bismuth. Sophie Germain and a Eisenstein prime. 
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Consecutive Integers 


Paul Erdds 


First published in issue 38, 1975/76 


utive integers were settled. Catalan conjec- 

tured that 8 and 9 are the only consecutive 
powers. First of all observe that four consecutive 
integers cannot all be powers since one of them is 
congruent to 2 modulo 4. 


S ome time ago, two old problems on consec- 


It is considerably more difficult to show that three 
consecutive integers can not all be powers; this 
was accomplished about 20 years ago by CassELs 
and Maxowskl. Finally in 1974 using some deep 
results of BAKER, TIJDEMAN proved that there is 
an My, whose value can be given explicitly, such 
that for n > ny, n and n+ 1 are not both powers. 
This settles Catalan’s conjecture almost completely. 


It has been conjectured that if x, < x, <x3<... is 
a sequence of consecutive powers, such as x, = 1, 
x, = 4, ... then x;,; - x; > i° for all i and some 
suitable constant c. At the moment this seems in- 
tractable. 


It was conjectured more than a century ago that 
the product of consecutive integers is never a 
power. Almost 40 years ago, R1GcE and I proved 
that the product of consecutive integers is never a 
square, and recently SELFRIDGE and I proved the 
general conjecture. In fact, our result is that for 
every k and / there exists a prime p > k such that if 


k 
ep | [] (a +i) 
i=l 


then 
a, = 1 mod p. 


Number of 5-faces of a seven-dimensional hypercube. 
Largest order of a permutation of 14 elements. 


We conjecture that in fact for all k > 2 there is a 
prime p > k with a,) = 1, but this is also intractable 
at the moment. 


It often happens in number theory that every new 
result suggests many new questions — which is a 
good thing as it ensures that the supply of Math- 
ematics is inexhaustible! I would now turn to dis- 
cuss a few more problems and results on consecu- 
tive integers and in particular a simple conjecture 
of mine which is more than 25 years old. 


Put 
m = a,(m) b,(m), 
a,(m) = || p%, 


where the product extends over all the primes 
p>kand p“|m. Further define 


f(ns k,l) = min{a,(n+i):1<i<lI}; 
F(k, 1) = min{f(n;k, l):1<n< oo}. 


I conjectured that 


lim HEH) =0. 
k-00 k 


(1) 


In other words, is it true that for every «¢ there is 
a k, such that for every k > k, at least one of the 
integers a)(n + i) fori=1, ..., 1, is less than k,. 1 am 
unable to prove this but will outline the proof of 


F(k,k) <(1+e)k for k > ko(e). (2) 
To prove (2) consider 
A(n, k) = TI. ar(n +i) (3) 


Paul Erdds (1913 — 1996) has published more > 
papers than any other mathematician in history. 


where the [’ in (3) indicates that for every p < kwe 
omit one of the integers n + i divisible by a maxi- 
mal power of p. Then the product IT’ a,(n + i) has 
at least k — m(k) factors and by a simple applica- 
tion of the Legendre formula for the factorisation 
of k! we obtain 


T] a(n +i) |e! (4) 


If (2) did not hold, we have from (4) and Stirling’s 
formula 


(A+e)Ky™ <hhe+ 


or Kes eae, 


(5) 


Now, by the prime number theorem, 
(1+ €/10)k 


k 
anys logk 


and so from (5), 


(1+ €/10)k | | 2k 
k+( (sek ca) > eet «(hs a). 


which is false if k is large enough. This contradic- 
tion proves (2). 


Assume for the moment that (1) has been proved. 
Then one can immediately ask for the true order 
of magnitude of F(k, k). I expect that it is o(k*) 
for every € > 0. On the other hand, I can prove that 


_log (k) loglog log (k) (6) 
log log (k) 
The problem of estimating F(k, k) and the proof 


of (6) is connected with the following question 
on the sieve of Eratosthenes-Prim-Selberg : deter- 


F(k,k) > exp (: 


mine or estimate the smallest integer A(k) so that 
one can find, for every p with A(k) < p< k,a resi- 
due u, such that for every integer t < k, t satisfies 
one of the congruences to u, modulo p. Clearly 
F(k, k) ¢ A(k). Using the method of Rankin-Chen 
and myself I proved 


Ais (« log arse) (7) 


log (k) 


which implies 6. I do not give the proofs here. It 
would be interesting and useful to prove A(k) < k® 
for every € > 0 and sufficiently large k. 


Now, I shall say a few words about F(k, 1) for k # 1. 


It follows easily from the Chinese Remainder The- 
orem that for 1 < m(k) we have F(k, 1) = 00, since 
for a suitable n, we can make n + i for 1 < i < 2(k) 
divisible by an arbitrarily large power of p,. It is 
easy to see that this no longer holds for / = 1(k) +1 
and in fact it is not hard to prove that 


F(k, mk) +1) = |] p%, 


where p% < m(k) < p**!. As I increases it gets 
much harder to even estimate F(k, !). 


Many more problems can be formulated which 
I leave to the reader and only state one which is 
quite fundamental: Determine or estimate the 
least 1 = 1, so that F(k, I.) = 1. 


In other words, the least J, so that among /; con- 
secutive integers there is always one relatively 
prime to the primes less than k. This question is 
of course connected with the problem of estimat- 


Centred triangular, square, 14-gonal and 
8-gonal number. Number of ways to tie a tie. 
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ing the difference of consecutive primes and also 
with the following problem of Jacobsthal: Denote 
by g(m) the least integer so that any set of g(m) 
consecutive integers contains one which is rela- 
tively prime to m. At a Number Theory meeting 
in Oberwolfach (November °75), Kanold gave an 
interesting talk on g(m). Vaughan observed that 
the sieve of Rosser gives g(m) < (log m)?** for all 
é > 0 if m is sufficiently large. The true order of 
magnitude is not known. 


It seems to me that interesting and difficult prob- 
lems remain for 1 < 1(k) too. Here we have to con- 
sider the dependence on n too. It is not hard to 
show that for every ¢ > 0 there are infinitely many 
values of n for which 


f (ask, 1) > (l=). (8) 


The proof of (8) uses some elementary facts of 
Diophantine approximation and the Chinese Re- 
mainder Theorem. We do not give the details. I 
do not know how much (8) can be improved. By 
a deep theorem of Mahler, using the p-adic Thue- 
Siegel Theorem, f(n; k, 1) > n**/'. It is quite pos- 
sible that 


lim sup 


noo 


(9) 


Interesting problems can also be raised if k tends 
to infinity with n; e.g. how large can f(n; k, (k)) 
become if k = (1+ 0(1)) log n? It seems to be dif- 
ficult to write a really short note on the subject 
since new problems occur while one is writing! 


F(mk 1) 


It would be of some interest to know how many of 
the integers a,(n +i) must be different. I expect 
that more than c xk are. If this is proved one of 
course must determine the best possible value of c. 


Denote by K(1) the greatest integer below J com- 
posed entirely of primes below k. Trivially 
min max a(n + i) = K(Z). (10) 
1 
To prove (10) observe that on the one hand any 
set of / consecutive integers contains a multiple 
of K(1) on the other that if 2] divides t, then the 
integers t! +1, ..., t! +1 clearly satisfy (10), when 
n = 0. More generally, try to characterise the set 
of n which satisfy (10). To simplify matters, let 
k= 1 and denote n, as the smallest positive integer 
with max; a,(n + i) = k, S; as the class of all inte- 
gers n such that this is true. If p% is the greatest 
power of p not exceeding k then 


Can be written as 222 in base 6. Number of 
metals in the periodic table. Happy number. 


lie c Sk. 
psk 


Perhaps I am overlooking an obvious explicit con- 
struction for n, but at the moment I do not even 
have good upper or lower bounds for it. When 
is k! in S,? The smallest such k is 8 and I do not 
know if there are infinitely many such k’s. But by 
Wilson's theorem, p! is never in S,. 


To complete this note, I state three more extremal 
problems in number theory. Put 


n!=|Ja;, fora, <a, <...<a 


n* 


Determine max 4). It follows easily from Stirling's 
formula that a, does not exceed 7(1- ge a 
conjectured that for every 7 > 0 and sufficiently 


large n, max a, exceeds 7 (1 - 7). 


Now write 
n'=|[b,, forl<b, <b, <...<b, <n. 


Determine or estimate min k. 


Clearly k exceeds n — n/log n and by more com- 
plicated methods I can prove 


k=n-n(1+o(1))/log n, 
k>n-n(logn+c)/ (log n)’, 
where c is a positive absolute constant. 


Finally write 


(11) 


Determine or estimate min u,, but k is not fixed. 
It is not hard to prove that u, less than 2n has only 
a finite number of solutions. I only know of two: 


n! =| [u;, for u, < Uy <1. < Ug. 


6!=8x9x 10, 
14! = 16 x 21 x 22 x 24 x 25 x 26 x 27 x 28. 


It would be difficult to determine all the solutions, 
although Vaughan has found some more: 


3! = 6, 
8!=12 x 14x 15 x 16, 
15! = 16 x18 x20 x21 x22 x25 x26 x27 x28, 
and these are all up to 15. Vaughan also tells me 


40! = 42 x44 x45 x48 x49 x50 x51 x52 x 
54x55 x56 x57 x58 x59 x60 x62 x63 x 
64 x65 x66 x68 x69 x72 x74 x80. 


The Ultimate Painting 
Drop Artists, 1966 


Sum of the squares of the first four primes. 87 
Sum of the divisors of the first ten integers. 


:' rchimedes was the greatest mathemati- 
/ cian, possibly the greatest scientist and 


certainly one of the greatest engineers of 
antiquity. Plutarch writes that although his en- 
gineering achievements ‘gave him the renown of 
more than human sagacity, ... he placed his whole 
affection and ambition in those purer speculations 
where there can be no reference to the vulgar needs 
of life; studies, the superiority of which to all oth- 
ers is unquestioned, and in which the only doubt 
can be, whether the beauty and grandeur of the 
subjects examined, or the precision and cogency of 
the methods and means of proof, most deserve our 
admiration. 


Archimedes’ final triumph as an engineer was the 
defence of Syracuse (in 212 BC) when ‘such terror 
seized the Romans, that, if they did but see a lit- 
tle rope or a piece of wood from the wall, instantly 
crying out, that there it was again, Archimedes was 
about to let fly some engine at them, they turned 
their backs and fled’. However, the Romans even- 
tually prevailed and he died in the sack of the 
city. The Romans, who organised the destruction 
of more people of every race, religion and col- 
our than any empire before, were always a little 
ashamed of killing the greatest mind of antiquity 
and invented several fine stories about his death. 


The writings of Archimedes were collected, cop- 
ied and expounded for the next 1500 years but, 
although some of those who studied them cer- 
tainly understood them, they do not seem to 
have progressed much beyond him. One reason 
for this may have been expressed by Cicero, who 
was proud of restoring the tomb of the great man. 


ei te ee 


Numbers of keys on a piano and constellations . 


in the sky. Days in a year on Mercury. 
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‘Among them [the Greeks] geometry was held in 
highest honour; nothing was more glorious than 
mathematics. But we [the Romans] have limited the 
usefulness of this art to measuring and calculating’ 
(Or, as EPSRC might put it, shaping capability’) It 
is perhaps, not surprising that, although the Ro- 
mans produced much bigger and somewhat better 
versions of existing technologies, they produced 
little that was entirely novel. Another reason for 
the lack of progress may have been the channel- 
ling of Greek abstract thought into the of endless 
marshes of Christian theological controversy. 


The fall of the Eastern Roman Empire, ending 
with the sack of Constantinople in 1492, result- 
ed in the loss of an enormous number of Greek 
manuscripts. Which books survived seems to 
have been mainly a matter of luck. Some of Ar- 
chimedes’ works survived as earlier translations 
into Arabic, but most of what survived was in two 
manuscripts which found their way into the pos- 
session of the Norman kings of the Two Sicily’s 
and then into the Vatican library. Both have since 
disappeared, but not before they were translated 
into Latin first by William of Markab in 1296 and 
then by James of Crewman in 1544. 


The invention of printing meant that Crewman’s 
translation could be widely distributed and Archi- 
medes became a hero and a source of inspiration 
to early scientists like Kepler and Galileo. Archi- 
medes showed them that mathematics could be 
used not merely to study the heavens (which had 
always had an ethereal and so mathematical feel) 
but everyday things like boats floating in water. 
Newton wrote his Principia in the Greek (that is 


a 


F 


to say, the Archimedian) style, making a difficult 
book even more difficult, but showing the respect 
due from one mage to another. 


The new calculus of Newton and Leibniz meant 
that any fool (measured on the Newton and Ar- 
chimedes scale) could find results which up then 
had required the genius of Archimedes and his 
direct influence on science came to an end. 


In 1906, a Danish classical scholar named Heiberg 
realised that a prayer book held in Istanbul was 
written on reused vellum and that the original text 
(which had, of course, been carefully scraped off) 
was a collection of Archimedean works. Working 
from photographs, he was able to recover most of 
this text. Several of the works were known in one 
form or another but one now called The Method of 
Mechanical Theorems created a sensation. 


It begins with words that still thrill me many years 
after I first read them: Archimedes to Eratosthenes 
greeting. [...] Seeing moreover in you, as I say, an 
earnest student, a man of considerable eminence 
in philosophy, and an admirer [of mathematical 
enquiry], I thought fit to write out for you [...] the 
peculiarity of a certain method, by which it will be 
possible [...] to investigate some of the problems in 
mathematics by means of mechanics. This proce- 


dure is, I am persuaded, no less useful even for the 
proof of the theorems themselves; for certain things 
first became clear to me by a mechanical method, 
although they had to be demonstrated by geometry 
afterwards because their investigation by the said 
method did not furnish an actual demonstration’ 


In other words, the great magician will draw back 
the curtain and reveal his secrets. And those se- 
crets turn out to be tremendous — not quite the 
modern calculus of Newton and Leibniz but cer- 
tainly containing many of the ideas, painfully 
discovered by their predecessors, which underlie 
that calculus. Alternative history is a mug’s game, 
but it is hard not to feel that, if Galileo or Kepler 
had held The Method in their hands, Western 
science would have been advanced by fifty years. 
Archimedes concluded his introduction with the 
words. ‘I am persuaded that [this method] will be 
of no little service to mathematics; for I apprehend 
that some, either of my contemporaries or of my 
successors, will, by means of the method when once 
established, be able to discover other theorems in 
addition, which have not yet occurred to me. But 
it was not to be. 


Naturally scholars returned to Istanbul to look at 
the original prayerbook, only to find it had disap- 
peared! 


In 1998 it reappeared, further damaged, in part, by neglect and, in part, 
= by a criminal attempt at forgery, and was sold at auction to an anony- 
ARCHIMEDES mous American for a mere two million dollars. Fortunately it was now 
PALIMPSEST in the charge of someone who knew its true value. The most modern 
: scientific techniques have been used to study it and the results are now 


— SS issued in two beautiful volumes by CUP. (More technical and scholarly [i 
[ | volumes will follow.) The first volume gives the background to the stud- [ey 
ies and the second images of the restoration itself. es <3 
| | ALE a 
i i thet | = From the point of view of the mathematician, little more is revealed [Asi 
. == i than was known through the work of Heiberg. However classicists were =! 
oo ee _ thrilled by the discovery of several speeches of Hypereides (one of the ia vie 
a scale major Greek orators), a commentary on Aristotle and several as yet uni- ih 
| dentified fr agments. AD bey 
The Archimedes Palimpsest It is unlikely that many people will fork out 150 pounds to buy these two 4 
Le 


ISBN-13: 9781107014572 volumes, particularly since one of them is in ancient Greek. But those 


$140.00 who do will own a triumph of the art of making books, a triumph of 4} 
the ability of modern science to make darkness visible, a triumph in the FRR AA 
= classicist’s six hundred year struggle to to restore the wisdom of the an- - ~ 
ak cients and a monument to a man who more than two thousand years ago i 


helped lay the foundations of the modern world. ‘History is indeed little 
more than the register of the crimes, follies and misfortunes of mankind’ 
but occasionally we get a glimpse of something better. These volumes 
are, as it were, the concentrated essence of civilisation. 
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Book Reviews 
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Mathematics: A Very Short Introduction 


Timothy Gowers Oxford University Press, 2002 
ISBN 978-0-19-285361-5 £7.99 


A thoroughly entertaining little book that lends itself well to casual 
reading, and which justifies its title wonderfully. While mathemat- 
ics students may find the concepts rather familiar or basic, Gowers’ 
lucid style and simple examples make the content accessible to all. 


Encouraging the reader to think abstractly, the book touches on 
topics such as fractional dimension, hyperbolic geometry and 
uses of mathematical models. Its essentially independent chapters 
can be read separately, but at the same time are neatly unified by 
the underlying philosophical flavour. Some may also find a Fields 
Medallist’s responses to oft-asked questions including “Ts it true 
that mathematicians are past it by the time they are 30?” and “Why 
are there so few women mathematicians?” intriguing, in the last 
chapter. 


This is an insightful bridge between the mathematics taught at 
school and what aspiring students can look forward to, and is rec- 
ommended for anyone with an interest in the subject. Stacey Law 


Just Six Numbers 


Sir Martin Rees Phoenix, 2001 
ISBN: 978-0-75-381022-4 £8.99 


Disregarding how abstract the topic is, a good mathematics book 
should be understood at some level by any reader. When I started 
my undergraduate course I just understood the basics from this 
book, that there are six main numbers that define cosmology: the 
number of dimensions we live in, the ratio of the strength of grav- 
ity to that of electromagnetism, ¢, the ratio of mass lost to energy 
when hydrogen is fused to form helium, ©, describing the amount 
of dark matter, A, the cosmological constant, and Q, related to the 
scale at which the universe looks smooth. 


In time I understood the rest of the book. It is a really good book 
to start with, since Martin Reese has managed to explain the key 
ideas behind cosmology today in 180 pages without any “fuss” 
equations. And it proves that cosmology can be done while having 
a nice cup of tea. Carina Negreanu 


Unitary perfect number and Pronic number. Latitude of 
the North and South poles. Length of a soccer match. 
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Proof of Death 


Chris Pearson Kindle eBook 
ASIN: BOO8U8R20K £0.99 


From a grim scene of hostages in Chechnya to the Great Clock 
in Trinity College, Pearson creates a web of mystery around the 
fictional proof of the Riemann Hypothesis. Weaving together vari- 
ous locations and plots to keep you avidly reading to the end, his 
thriller cleverly incorporates both number theory and its applica- 
tion to cryptography. The plot is skilfully designed so that math- 
ematicians and non-mathematicians alike are sucked into Aslan’s 
world of survival and deceit. 


‘A prime number, of course - no divisors except itself and one - al- 
ways yielded the best brew.” Packed with emotion and description, 
this book is sure to provide a fictional world of mathematical proof 
that any reader can easily delve into. Eleanor Wale, Reading 


Algebraic Number Theory and 
Fermat’s Last Theorem 


lan Stewart, David Tall AK Peters, 2002 
ISBN 1-56881-119-5 £37.99 


It is difficult to find a mathematics book that is both precise and in- 
formal. This book has both qualities, giving historical background 
information while rigorously developing algebraic number theory. 
It is suitable for undergraduates meeting the subject for the first 
time. Definitions are motivated and important concepts are illus- 
trated by computational examples. 


The material in the first 10 chapters is approximately equivalent 
to the Part II Number Fields course, landmarks being ideals, 
Minkowski's Theorem, and the class-group. The remaining chap- 
ters contain the proof of a special case of Fermat's Last Theorem 
(regular prime exponents), which uses all the previously intro- 
duced ideas. They also touch on more advanced topics leading up 
to a sketch proof of its general version. 


The extra material on elliptic curves and elliptic functions has lit- 
tle to do with the rest of the book and feels a bit disconnected. 
However, the chapters on algebraic number theory are excellent 
for accompanying a university course, while the last part will whet 
the reader's appetite for more. Philipp Kleppmann 


Smallest non-trivial cabtaxi number. Smallest 
pseudoprime satisfying 3” = 3 mod n. 
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Archimedeans 


Christmas Catalogue 


Klein Salad Dressing Bottle 
Keep oil on the inside, 
vinegar on the outside 


£20 


Quantum Surfboard 
Dont use on unrestricted 
wavefunctions 


Random Walk Generator 
Comprises stereotypical mathema- 
tician and half-pint of larger. 


Calculus-removing 
toothpaste 
Guaranteed opaque. 


3D Random Walk Generator 


Also includes centrifuge and 
trampoline. 

Accessories: reflecting barriers, absorb- 
ing barriers, extra-absorbing barriers 
for mopping up resulting spills. 


Anthropomorphiser 
Ascribes human qualities and 
emotions to functions, sets, 
numbers etc. Not for use on 
mathematicians! 


Epsilon Magnifier 

Sick of struggling with tiny 
epsilons? The revolutionary new 
epsilon magnifier simplifies analy- 
sis by increasing all epsilons to 
values > 1. 


Set of Pathological Cases 


For the more experienced 
traveller, save money with our 
nowhere-dense set of luggage. 


The Escher Machine 


The ball rolling down an infinite 
slope generates enough energy 
to power a light bulb. 


NEW! Uphill version: uses two 
1.5V AA batteries per day. The 
ideal gift for someone you dislike. 


All items in our catalogue can be 
ordered by writing to 


The Archimedeans 
Q Kolmogorov Street 
X1024 Cantortown 


Number of “atomic elements” in the Look-and-say 
sequence. Atomic number of Uranium. 


Invented by Chris Cummins, Eureka 56 


Compiled by CJ Budd, Eureka 43 


Here you see three different definitions for some obscure mathematical terms. 
It is your task to find — with justification — the correct one. Solutions are on page 94. 


The Tarry Point 


(1) A system of organising movements in a 
Bridge tournament, formalised by Mr Room. 


(2) A dissection of a square into smaller . 
squares of different side lengths (‘rooms’), 
discovered by Trinity students in 1939. 


(3) Tile a cuboid in R” regularly by sub- 
cuboids. Mark certain faces, ensuring that 
no subcuboid has >3 faces marked. Then if 
it is possible to go from one subcuboid to 
another entirely by marked faces, we have a 
room design, where the subcuboids are the 
rooms and the marked faces are the doors. 


Unoer's Translation 


(1) On each face of a tetrahedron construct 
another tetrahedron of side 1/3 of the origi- 
nal. Continue this process for ever, What 
you end up with is a mouse: a finitely small 
yet infinitely furry little animal. 


(2) A mouse is, naturally, a subset of a Cat, 
a connected absorbing topology! A mouse 
is any subset of a cat which has a tail (i.e. 

a proper one dimensional subset. This tail 
must of course be unique and no two mice 
are permitted to have the same tail. 


(3) A premouse is an admissable set with 
an ultrafilter which thinks the ordering it 
gets from the ultrafilter is a well-ordering. 
If the ordering is close enough to let us 
iterate on the ultrafilter we have an iterable 
premouse. If this is well behaved we have 

a critical iterable premouse. A mouse is a 
critical iterable premouse for which every 
sub-premouse is also critical. 


Number of sets with 8 integers so that each is a 93 
proper divisor of the product of the others plus 1. 
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Solutions 


Archimedeans Problems Drive 


1 Dazzling Dice 
Most likely is 6, with probability 16807/46656. 


2 Snappy Surds 
20, 28 and 100. 


3 Painful Primes 
999 917 is prime. 


4 Compelling Convergence 
(a) diverges; 

(b) converges to 77/6; 

(c) converges to tanh ‘(log 2) — log 2. 


5 Superb Sets 

(a) is countably infinite; 

(b) is finite of size 1; 

(c) is uncountable as it bijects with | 
(d) is uncountable as it bijects with | 


we 
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6 Triumphant Treasures 


The treasure is buried on the moon. 


7 Curious Coins 
You want to go first for all n. 


8 Perceptive Polygons 
Yes! 


9 Terrible Triangles 


Call My Bluff 


The Tarry Point: Definition 2 
A Room Design: Definition 1 


Number of graphic non-whitespace characters 
in ASCII. Smith number. 94! - 1 is prime. 


10 Rough Relations 


0 n<3 
f(n) = 2 n=4 
tn(nt+1) n25 


11 Gorgeous Geometry 
Line perpendicular to OD. 


12 Mysterious Matchings 
2 


13 Dazzling Digits 
4000 


14 Cryptic Crossword 


The treasure is buried on the moon. 


Unger’s Translation: Definition 3 
A Mouse: Definition 3 
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